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Abstract 

We give the first polylogarithmic-competitive randomized online algorithm for the A:-server 
problem on an arbitrary finite metric space. In particular, our algorithm achieves a competitive 
ratio of 0(log'^ nlog^ A:) for any metric space on n points. Our algorithm improves upon the 
deterministic {2k — l)-competitive algorithm of Koutsoupias and Papadimitriou [23] whenever 
n is sub-exponential in k. 

1 Introduction 

The fe-server problem is one of the most fundamental and extensively studied problems in online 
computation. Suppose there is an re-point metric space and k servers are located at some of the 
points of the metric space. At each time step, an online algorithm is given a request at one of the 
points of the metric space, and this request is served by moving a server to the requested point (if 
there is no server there already). The cost of serving a request is defined to be the distance traveled 
by the server. Given a sequence of requests, the task is to devise an online strategy minimizing the 
sum of the costs of serving the requests. 

The fc-server problem was originally proposed by Manasse et al. [24] as a broad generalization of 
various online problems. The most well studied problem among them is the paging (also known as 
caching) problem, in which there is a cache that can hold up to k pages out of a universe of re pages. 
At each time step a page is requested; if the page is already in the cache then no cost is incurred, 
otherwise it must be brought into the cache (possibly causing an eviction of some other page) at 
a cost of one unit. It is easy to see that the paging problem is equivalent to the fc-server problem 
on a uniform metric space, and already in their seminal paper on competitive analysis, Sleator 
and Tarjan [27] gave several fc-competitive algorithms for paging, and showed that no deterministic 
algorithm can do better. This prompted Manasse et al. [24] to state a far-reaching conjecture that 
a similar result holds for an arbitrary metric. More precisely, they conjectured that there is a 
/c-competitive online algorithm for the /c-server problem on any metric space and for any value of 
k. This conjecture is known as as the k-server conjecture. 
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At the time that the fc-server conjecture was stated, an onUne algorithm with competitive 
ratio that depends only on k was not known. It was first obtained by Fiat et al. [19]. Improved 
bomids were obtained later on by [22, 9], though the ratio still remained exponential in k. A major 
breakthrough was achieved by Koutsoupias and Papadimitriou [23] , who showed that so-called work 
function algorithm is {2k — l)-competitive. This result is almost optimal, since we know that any 
deterministic algorithm has to be at least /c-competitive. We note that a tight competitive factor 
of k is only known for special metrics such as the uniform metric, line metric, and - more generally 
- trees [14, 15]. 

Even though the aforementioned results are all deterministic, there is also a great deal of interest 
in randomized algorithms for the /c-server problem. This is motivated primarily by the fact that 
randomized online algorithms (i.e., algorithms working against an oblivious adversary) tend to 
have much better performance than their deterministic counterparts. For example, for the paging 
problem, several 0(log A;)-competitive algorithms are known [20, 25, 1, 2], as well as a lower bound 
of ri(logfc) on the competitive ratio. 

Unfortunately, our understanding of the /c-server problem when randomization is allowed is 
much weaker than in the deterministic case. Despite much work [12, 8, 10], no better lower bound 
than ri(log k) is known on competitive factors in the randomized setting. Conversely, no better 
upper bound, other than the deterministic guarantee of 2/c — 1 [23] mentioned above, is known 
for general metrics. Thus, an exponential gap still remains between the known lower and upper 
bounds. 

Given the lack of any lower bounds better than Q{\ogk), it is widely believed that there is an 
0(log A;)- competitive randomized algorithm for the fc-server problem on every metric space against 
an oblivious adversary - this belief is captured by the so-called randomized k-server conjecture. 
Unfortunately, besides the previously-mentioned 0(log A;)-competitive algorithm for the case of a 
uniform metric, even when we allow the competitiveness to depend on other parameters of the 
metric, such as the number of points n, or the diameter A, non-trivial guarantees are known only 
for very few special cases. For example, the case of a well-separated metric [26], the case of a metric 
corresponding to a binary HST with high separation [16], the case of n = A; + 0(1) [7], as well as 
some other cases [17, 3, 4]. For the weighted paging problem^, [2] gave an 0(log A;)-competitive 
algorithm (see also [3]) which is based on the online primal-dual approach. However, no non-trivial 
guarantees are known even for very simple extensions of the uniform metric, e.g., two-level HSTs 
with high separation. 

For a more in-depth treatment of the extensive literature on both paging and the /c-server 
problem, we suggest [13]. 

1.1 Our Result 

We give the first polylogarithmic competitive algorithm for the /c-server problem on a general metric 
with a finite number of points n. More precisely, our main result is the following. 

Theorem 1. There is a randomized algorithm for the k-server problem that achieves a competitive 
ratio o/ 0(log^ /c log^ nloglogn) = 0(log^ /c log'^ n) on any metric space on n points. 

The starting point of our algorithm is the recent approach proposed by Cote et al. [16] for 
solving the /c-server problem on hierarchically well-separated trees (HSTs). It is well known that 

^In weighted paging, arbitrary weights are associated with fetching the pages into tlie cache. This problem 
corresponds to the fc-server problem on a weighted star. 
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solving the problem on HSTs suffices, as any metric space can be embedded into a probability 
distribution over HSTs with low distortion [18]. 

More precisely, Cote et al. defined a problem on uniform metrics which we call the allocation 
problem. They showed that an online randomized algorithm for the allocation problem that provides 
certain refined competitive guarantees can be used as a building block to recursively solve the k- 
server problem on an HST, provided the HST is sufficiently well-separated. Roughly speaking, in 
their construction, each internal node of the HST runs an instance of the allocation problem that 
determines how to distribute the available servers among its children nodes. Starting from the 
root, which has k servers, the recursive calls to the allocation problem determine the number of 
servers at each leaf of the HST, giving a valid A:-server solution. The guarantee of this /c-server 
solution depends on both the guarantees for the allocation problem, as well as the depth of the 
HST (i.e., the number of levels of recursion). The guarantees obtained by Cote et al. [16] for the 
allocation problem on a metric space with two points allowed them to obtain an algorithm for the 
/c-server problem on a sufficiently well-separated binary HST having a competitive ratio that is 
poly logarithmic in k, n, and the diameter A of the underlying metric space. Unfortunately, the 
fact that the HST has to be binary as well as have a sufficiently good separation severely restricts 
the metric spaces to which this algorithm can be applied. 

Given the result of Cote et al. [16], a natural approach to establishing our result is coming 
up with a randomized algorithm having the required refined guarantees for the allocation problem 
on an arbitrary number of points. However, it is unclear to us how to obtain such an algorithm. 
Instead, we pursue a more refined approach to solving the fc-server problem via the allocation 
problem. By doing so we are able to bypass the need for a "true" randomized algorithm for the 
allocation problem and are able to work with a (much) weaker formulation. More precisely, our 
result consists of three main parts. 

1. We show that instead of obtaining a randomized algorithm for the allocation problem, it suf- 
fices to obtain an algorithm for a certain fractional relaxation of it. Employing this relaxation 
makes the task of designing such a fractional allocation algorithm easier than designing the 
version of the allocation problem that was considered earlier. Next, building upon the argu- 
ments in Cote et al. [16], we show that a sufficiently good online algorithm for this fractional 
allocation problem can be used as a building block to obtain a good fractional solution to 
the /c-server problem on an HST. Finally, by proving that such a fractional fe-server solution 
can be rounded in an online randomized manner, while losing only an 0(1) factor in the 
competitive ratio, we get a reduction of the fc-server problem to our version of the fractional 
allocation problem. 

An interesting feature of this reduction is that our fractional relaxation is too weak to give 
anything better than an 0{k) guarantee for the (integral) allocation problem, since there are 
instances on which any integral solution must pay ^}{k) times the fractional cost. Therefore, 
it is somewhat surprising that even though our relaxation is unable to provide any reasonable 
algorithm for the (integral) allocation problem, it suffices to give a good guarantee for the 
(integral) A;-server problem. 

2. As the next step, we design an online algorithm for the fractional allocation problem with the 
refined guarantees required in the above reduction. Our techniques here are inspired by the 
ideas developed recently in the context of the caching with costs problem [3] and weighted 
paging [2]. However, while these previous algorithms were designed and described using the 
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online primal-dual framework, our algorithm is combinatorial. To analyze the performance 
we employ a novel potential function approach. 

By plugging the algorithm for the fractional allocation problem into the above reduction, we 
get a (roughly) 0(£log(A:£))-competitive algorithm for the k-seiver problem on an HST of 
depth i, provided that the HST is sufficiently well-separated. 

3. Finally, we note that the competitive guarantee provided by the above A;-server algorithm 
depends on the depth i of the HST we are working with and, as i can be i7(logA), this 
guarantee can be polylogarithmic in A. Therefore, as A can be 2*^('^), this would lead to 
competitiveness that is even polynomial in n.^ To deal with this issue, we define a weighted 
version of an HST in which the edge lengths on any root-to-leaf path still decrease at (at 
least) an exponential rate, but the lengths of the edges from a given node to its children could 
be non-uniform. We prove that any HST can be transformed to a weighted HST of depth 
i = O(logn) while incurring only an 0(1) distortion in leaf-to-leaf distances. We then show 
that our previous ideas can be applied to weighted HSTs as well. In particular, our online 
fractional allocation algorithm is actually developed for a weighted star metric (instead of a 
uniform one), and, as we show, it can be employed in our reduction to obtain a fractional k- 
server algorithm on a weighted HST. The fractional fc-server algorithm can again be rounded 
to a randomized algorithm with only an 0(1) factor loss. Since i is now O(logn) and thus 
does not depend on A, it gives us an overall guarantee which is polylogarithmic only in n and 
k. 

In Section 2 we describe the above ideas more formally and also give an overview of the paper. 
1.2 Preliminaries 

We provide definitions and concepts that will be needed in the paper. 

Hierarchically well-separated trees. Hierarchical well-separated trees (HST-s), introduced 
by Bartal [5, 6], is a metric embedding technique in which a general metric is embedded into a 
probability distribution defined over a set of structured trees (the HST-s). The points of the metric 
are mapped onto the leaves of the HST, while internal tree nodes represent clusters. The distances 
along a root-leaf path form a geometric sequence, and this factor is called the stretch of the HST. 
An HST with stretch a is called a a-HST. The embedding guarantees that the distance between 
any pair of vertices in the metric can only increase in an HST, and the expected blowup of each 
distance, known as the distortion, is bounded. It is well known that any metric on n points can 
be embedded into a distribution over c-HSTs with distortion 0{a\og^n) [18]. This approach of 
embedding into HSTs is particularly useful for problems (both offline and online) which seem hard 
on a general metric, but can be solved fairly easily on trees (or HSTs). 

Due to the special structure of HSTs, the task of solving problems on them can sometimes be 
reduced to the task of solving a more general (and thus harder) problem, but on a uniform metric. 
For example, this approach was used to obtain the first polylogarithmic guarantees for the metrical 
task systems problem (MTS) by [7] (later further refined by [21]). More precisely, Blum et al. [7] 
defined a refined version of MTS on a uniform metric known as unfair-MTS and showed how an 

^To see an example when this is the case, one could consider a metric space corresponding to n points on a line 
that are spaced at geometrically increasing distances. 
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algorithm with a certain refined guarantee for it can be used recursively to obtain an algorithm for 
MTS on an HST. This approach is especially appealing in the context of the A:-server problem, as 
this problem on a uniform metric (i.e. paging) is well- understood. This motivated Cote et al. [16] 
to define a problem on a uniform metric, that we call the allocation problem, and show how a good 
algorithm for it can be used to design good /c-server algorithms on HSTs. This problem is defined 
as follows. 

The allocation problem. Suppose that a metric on d points is defined by a weighted star in 
which the distance from the center to each point i, 1 < i < d, is Wi.^ At time step t, the total 
number of available servers, K{t) < k, is specified, and we call the vector k, = (k(1), k{2), . . .) the 
quota pattern. A request arrives at a point i* and it is specified by a (/e + l)-dimensional vector 
/i* = (/i*(0), . . . ,h^{k)), where denotes the cost of serving the request using j servers. 

The cost vectors at any time are guaranteed to satisfy the following monotonicity property: for 
any < j < A; — 1, the costs satisfy > + 1). That is, serving a request with more servers 
cannot increase the cost. Upon receiving a request, the algorithm may choose to move additional 
servers from other locations to the requested point and then serve it. The cost is divided into two 
parts. The movement cost incurred for moving the servers, and the hit cost determined by the cost 
vector and the number of servers at location z*. 

In this paper, we will be interested in designing algorithms for (a fractional version of) this 
problem that provide a certain refined competitive guarantee. Namely, we say that an online 
algorithm for the allocation problem is {6, j)- competitive if it incurs: 

• a hit cost of at most 6 ■ (Optcost + A • g{K)); 

• a movement cost of at most 7 • (Optcost + A • g{K)), 

where Optcost is the total cost (i.e., hit cost plus movement cost) of an optimal solution to a given 
instance of the allocation problem, g{K) := l'^(i) — ^{t — 1)| is the total variation of the server 
quota pattern, and A is the diameter of the underlying metric space. 

From allocation to /c-server. Cote et al. [16] showed that a (1 + e, /3)-competitive online algo- 
rithm for the allocation problem on d points - provided e is small enough and /3 = Oe(polylog((i, k)) 
- can be used to obtain a polylogarithmic competitive algorithm for the /c-server problem on general 
metrics. In particular, the next theorem follows from their work and is stated explicitly in [3]. 

Theorem 2. Suppose there is a {1 + e, /3)- competitive algorithm for the allocation problem on a 
uniform metric on d points. Let H be an a-HST with depth i. Then, for any e < 1, there is an 
0{/3'y^~^^ /{'J — 1))- competitive algorithm for the k-server problem on H, where 



Setting e = l/l, this gives an O {(5 1)- competitive algorithm on a-HSTs, provided the HST separation 
parameter a is at least 

■^Evcn though Cote et al. [16] considered the aUocation problem on a uniform metric, we find it useful to work 
with the more general weighted-star metric version of this problem. 
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At a high level, the /c-server algorithm in Theorem 2 is obtained as follows. Each internal 
node p in the HST runs an instance of the allocation problem on the uniform metric formed by its 
children. In this instance, the cost vectors appearing at a child i are guided by the evolution of 
the cost of the optimal solution to the instance of the /c-server problem restricted to the leaves of 
the subtree that is rooted at i. Furthermore, the quota patterns for each of the allocation problem 
instances is determined recursively. The root of the tree has a fixed server quota of k, and the quota 
corresponding to a non-root node i is specified by the number of servers that are allocated to i by 
the instance of the allocation problem run at the parent of i. The distribution of the servers on the 
leaves of the tree is determined in this manner, thus leading to a solution to the /c-server problem. 
The overall guarantee in Theorem 2 follows roughly by showing that the hit cost guarantee of (l+e) 
multiplies at each level of the recursion, while the movement cost guarantee of f3 adds up. 

Weighted HSTs. Note that the guarantee in Theorem 2 depends on i, the depth of the cr-HST, 
which in general is @{log^ A). To avoid the resulting dependence on the diameter A that can be as 
large as 2^^"), we introduce the notion of a weighted cr-HST. A weighted a-HST is a tree having the 
property that for any node p, which is not the root or a leaf, the distance from p to its parent is at 
least a times the distance from p to any of its children. Thus, unlike an HST, distances from p to 
its children can be non-uniform. The crucial property of weighted HSTs that we will show later is 
that any cr-HST T with 0(n) nodes can be embedded into a weighted cr-HST with depth O(logn), 
such that the distance between any pair of leaves of T is distorted by a factor of at most 2a/{a — 1) 
(which is 0(1) if, say, a > 2). Reducing the depth from 0(log A) to O(logn) allows us to replace 
the factor of log A by log n in the bound on the competitive factor we get for the /c-server problem. 

Fractional view of randomized algorithms. The relation between randomized algorithms 
and their corresponding fractional views is an important theme in our paper. By definition, a 
randomized algorithm is completely specified by the probability distribution over the configura- 
tions (deterministic states) at each time step of the algorithm. However, working explicitly with 
such distributions is usually very cumbersome and complex, and it is often simpler to work with 
a fractional view of it. In a fractional view, the algorithm only keeps track of the marginal dis- 
tributions on certain quantities, and specifies how these marginals evolve with time. Note that 
there are multiple ways to define a fractional view (depending on which marginals are tracked). 
For example, for the /c-server problem on an HST, the fractional view might simply correspond to 
specifying the probability pi of having a server at leaf i (instead of specifying the entire distribution 
on the fc-tuples of possible server locations). Clearly, the fractional view is a lossy representation of 
the actual randomized algorithm. However, in many cases (though not always), a fractional view 
can be converted back to a randomized algorithm with only a small loss. We now describe the 
fractional views we employ for the two main problems considered in this paper. 

Fractional view of the /c-server problem on an HST. Let T be a cr-HST. For a node j G T, 
let T{j) be the set of leaves in the subtree rooted at j. In the fractional view, at each time step t, 
the probability of having a server at leaf i, denoted by p-, is specified. Upon getting a request at 
leaf i at time t, a fractional algorithm must ensure that = 1. Let the expected number of servers 
at time t at leaves of T(j) be denoted by k*{j) = J2ieT{j)Pi- Clearly, the movement cost incurred 
at time t is YljeT ~ f^*'~^U)\j where W{j) is the distance from j to its parent in T. 
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It is easy to verify that the cost incurred by any randomized algorithm is at least as large as 
the cost incurred by its induced fractional view. Conversely, it turns out that the fractional view is 
not too lossy for a cj-HST (provided o" > 5). In particular, in Section 5.2 we show that for a cr-HST 
(cj > 5), an online algorithm for the fc-server problem in the fractional view above can be converted 
into an online randomized algorithm, while losing only an 0(1) factor in the competitive ratio. 



The fractional allocation problem. For the allocation problem we consider the following frac- 
tional view. For each location i E [d], and all possible number of servers j € {0, . . . , k}, there is a 
variable ■ denoting the probability of having exactly j servers at location i at time t. For each 
time t, the variables x\ ^ must satisfy the following constraints. 

1. For each location i, the variables x\ ■ specify a probability distribution, i.e., 'Ylij^\j — ^ ^^^^ 
each x\ ■ is non-negative. 

2. The number of servers used is at most K(t), the number of available servers. That is. 



At time step t, when cost vector /i* arrives at location z*, and possibly K(t) changes, the algorithm 
can change its distribution from x^~^ to incurring a hit cost of h^{j)xjt j- The movement 
cost incurred is defined to be 



E ^h' 

j'<j 



E 

j'<j 



(1) 



Remark: Note that when our configurations are integral, this quantity is exactly the cost of 
moving the servers from configuration x*"^ to configuration x*. In the fractional case, each term 
in the outermost sum can be seen as equal to the earthmover distance between the probability 
vectors {xI~q^ , . . . , x*"^^) and (x* q, . . . , x- ^) with respect to a linear metric defined on {0, 1, ... , k}. 
The earthmover distance is the optimal solution to a transportation problem in which x*~^ is the 
supply vector, x* is the demand vector, and the cost of sending one unit of flow between x*~^ and 
x*y is Wi ■ \j — since |j — j'\ is the change in number of servers resulting from sending this 
unit of flow. It is not hard to see"^ that in the case of a linear metric, the optimal solution to the 
transportation problem (up to a factor of 2) is captured by (1). 



A gap instance for the fractional allocation problem. As mentioned earlier, unlike the 
fractional view of the A;-server problem presented above, the fractional view of the allocation problem 
turns out to be too weak to yield a randomized algorithm for its integral counterpart. We thus 
present an instance of the allocation problem for which the ratio between the cost of any integral 
solution and the cost of an optimal fractional solution is 0(/c). However, we stress that even though 
the fractional view fails to approximate the integral allocation problem, we are still able to use it 
to design a fractional (and, in turn, integral) solution to the fe-server problem. In particular, we 

^Using uncrossing arguments on the optimal transportation solution. 
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show in Section 4 that Theorem 2 holds even when we substitute the randomized algorithm for the 
allocation problem with the fractional algorithm. 

Let us consider a uniform metric space over d = 2 points, and consider an instance of the 
allocation problem in which exactly K{t) = k servers are available at each time. Furthermore, at 
each odd time step 1, 3, 5, . . ., the cost vector h = (1, 1, 1, 0) arrives at location 1, and at each 
even time step 2,4,6, . . ., the vector h' = (1,0,0, ...,0) arrives at location 2. 

We show that any integral solution to this instance of the allocation problem must incur a high 
cost, while there is an Q{k) times cheaper solution in the fractional view. 

Claim 3. Any solution to the instance above incurs a cost of Vt{T) over T time steps. 

Proof. Observe that the hit cost can be avoided at location 1 only if it contains k servers, and it 
can be avoided at location 2 only if it contains at least one server. Thus, any algorithm that does 
not pay a hit cost of at least 1 during any two consecutive time steps, must move at least one 
server between locations 1 and 2, incurring a movement cost of at least 1, concluding that the cost 
is n{T). □ 

Claim 4. There is a solution in the fractional view of cost 0{T/k) over T time steps. 

Proof. Consider the following solution in the fractional view. At each time step t, let: 

— — , Xii^ — 1 — — , and 3:21 — 1. 

Note that this solution satisfies all the constraints in the fractional view. Since location 2 always has 
a server, it never pays any hit cost. Moreover, location 1 has fewer than k servers with probability 
1/A;, it thus incurs only a hit cost 1 • xi^o = 1/^ a-t every odd time step. Also, as the solution does 
not change over time, the movement cost is 0. □ 



2 Overview of Our Approach 

In this section we give a formal description of our results, outline how they are organized, and 
discuss how they fit together so as to obtain our main result. 



Fractional allocation algorithm. In Section 3 we consider the fractional allocation problem 
on a weighted star, and prove the following theorem. 

Theorem 5. For any e > 0, there exists a fractional (1 + e,0{\og{k/ e)))- competitive allocation 
algorithm on a weighted star metric. 



From allocation to A;-server problem. In Section 4 we show how the algorithm from Theorem 
5 can be used to obtain a fractional fc-server algorithm on a sufficiently well-separated weighted 
HST. In particular, we show that: 

Theorem 6. Let T he a weighted a-HST of depth i. If, for any < e < 1, there exists a 
{l + e,log{k/e))- competitive algorithm for the fractional allocation problem on a weighted star, then 
there is an 0{ilog{ki))- competitive algorithm for the fractional k-server problem on T, provided 
a = n{nog{k£)). 
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Putting it all together. We now show how to use Theorems 5 and 6 to prove our A;-server 
guarantee for general metrics, i.e., to prove Theorem 1. 

To this end, we need two more results that we prove in Section 5. First, 

Theorem 7. Let T be a a-HST with a > 5. Then any online fractional k-server algorithm on 
T can be converted into a randomized k-server algorithm on T with an 0(1) factor loss in the 
competitive ratio. 

Note that the above result gives a rounding procedure only for HSTs (and not weighted HSTs). 
To relate HSTs to weighted HSTs, we show the following. 

Theorem 8. Let T be a a-HST with n leaves, but possibly arbitrary depth. Then T can be trans- 
formed into a weighted a-HST T such that: T has depth O(logn), the leaves of T and T are 
identical, and any leaf to leaf distance in T is distorted by a factor of at most 2a /{a — 1) in T. 

Given the above results, we can present the proof of our main theorem. 

Proof of Theorem 1. Our algorithm proceeds as follows. First, we use the standard technique 
[18] to embed the input (arbitrary) metric M into a distribution /i over cr-HSTs with stretch 
a = B(lognlog(A;logn)). This incurs a distortion of 0(cr log^n) and the resulting HSTs have 
depth 0(logg. A), where A is the diameter of M. 

Next, we pick a random HST T according to the distribution /x, and transform T into T using 
Theorem 8. As T has depth (. = O(logn), it holds that a = Q{t\og{kt)) and hence applying 
Theorem 6 to T gives an 0{l\og{kt)) = 0(lognlog(A:log?i))-competitive fractional fc-server algo- 
rithm on T. Since the leaves of T and T are identical, and the distances only have 0(1) distortion, 
the fractional fc-server solution on T induces an 0(log n log(A; log n))-competitive fractional /c-server 
solution on T. By Theorem 7, this gives an 0(log n log(A; log n))-competitive randomized A;-server 
algorithm on T. 

We now relate the optimum fc-server cost on M to the optimum on T. Let Opt^^ denote the 
optimum /c-server solution on Af , and let ct denote the cost of this solution on T. Since the 
expected distortion of distances in our ensemble of HSTs is small, we have: 

¥.^[cT] = 0{a\og,n)-Ovi*M. (2) 

Let AXgrp denote the cost of the solution produced by the online algorithm on T, and let Algjyj 
denote the cost of this solution on the metric M. As the pairwise distances in T are at least the 
distances in M, Alg^;.^ < Alg^. Also, as Alg^. is 0(log n log(A; log n))-competitive, it follows that: 

Alg^/ < Algj- = 0(lognlog(A;logn)) • < 0(log n log(/c log n)) • ct 

where is the optimum /c-server cost on T (and hence < ct). Taking expectation with respect 
to fjL above and using (2), the expected cost of our solution E^[Alg^./'] satisfies: 

IE^»[AlgM] = 0(lognlog(A;log?i)) •E^[ct] = O(o-log^n) • 0(log ?i log(A; log n)) • Opt^^, 

which implies that the overall algorithm has a competitive ratio of 

/ (\ogn\\ /log^n(log(/clogn))2\ ^ 2 , , 3 , , \ 

O \ a { I ] ■ O (log n log(/c log nj) = O — = O (log k log n log log n) . 

V VogaJJ V log log n J 

□ 



9 



3 The Fractional Allocation Problem 



Consider a metric corresponding to a weighted star on d leaves (also called locations) 1, . . . ,d, where 
Wi is the distance from leaf i to the root. Let us fix a sequence of cost vectors h^,h^,. . . and a 
server quota pattern k = (k;(1),k(2), . . .), where K{t) is the number of servers available at time t, 
and K{t) < k for all times t. 

Recall that in the fractional allocation problem the state at each time t is described by non- 
negative variables x- ^ denoting the probability that there are exactly j servers at location i. At 
each time t, the variables x^j satisfy: (1) ^Ij — 1> each i; (2) YliJ2j J^ij — ^(*)- 

As we shall see, when describing and analyzing our algorithm for the fractional allocation 
problem, it will be easier to work with variables y* ^, defined as 

yli = Z]4i'' for i G {!,..., d}, j e{l,2,...,k + l}. 

j'=0 

I.e., ul j is the probability that at time t we have less than j servers at location i. Clearly, for every 
i, as long as: 

yh e [0,1] (3) 
yij-i < yij, y^,k+l = 1, V? g {l, . . . , d}, j e{2,...,k + l}, (4) 

there is always a unique setting of the variables a;* ^s that corresponds to the yjjS. Therefore, in 
what follows we make sure that the variables yjjS generated by our algorithm satisfy the above two 
conditions. 

The condition that at most K,{t) servers are available at each time t can be expressed in terms 
of yj j as: 



d k d k 

i=l j=l i=l j=0 i=l j=0 i=l j=0 

(d k \ 
EEK. >kd-K{t). (5) 
i=l j=0 / 

Let us now focus on a particular cost vector /i* = (/i*(0), /i*(l), . . . ,h^{k)) corresponding to time 
step t. Recall that is the hit cost incurred when serving the request using exactly j servers. 

We can express /i* as 

■ h\j-l)-h'{j) j = l,2,...,k 
'"^ \ h\k) j = k + l 

The variables A*- are non-negative as the hit costs are non-increasing in j, i.e., /i*(0) > /i*(l) > 
. . . > h^{k). Intuitively, A*- captures the marginal cost of serving the request with strictly less than 
j servers.^ The hit cost incurred by a configuration = now has a simple formulation. 



A 



^We note that we can assume that A^+i is always 0. Otherwise, as any vahd algorithm (including the optimal one) 
always has at most k servers at a given location, any competitive analysis established for the case A^_,_j^ = = 

carries over to the general case. Thus, from now on we remove Xl-^i and also yt^k+i (that is always 1) from our 
considerations and notation. 
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Let denote the location on which the hit cost vector /i* appears, then the hit cost X^j=o ^^OO^i* j 
can be expressed as 

Similarly, expression (1) for the movement cost from a configuration to a configuration y* 
becomes 

d / , 

.t „.i-l 



j=i \i=i 



3.1 Description of the Algorithm 

In light of the above discussion it suffices to specify how state {yl^^}i,j evolves to {yljjij at time t 
upon arrival of cost vector /i* and server quota K{t). Our algorithm performs this evolution in two 
stages. First, it executes a fix stage in which the number of servers is decreased so as to comply 
with a decrease of the quota K{t). Then, it proceeds with a hit stage, during which the (fractional) 
configuration of the servers is modified to react to the cost vector /i*. We describe the dynamics 
of both stages as a continuous process governed by a set of differential equations. As it turns out, 
viewing the evolution of the server configuration this way allows us to both simplify the description 
and the analysis of the algorithm. The evolution of the fractional solution during the fix stage is 
parametrized by a time index r that starts at and grows until the number of servers is no more 
than K{t). The hit stage is parametrized by a time index r] that starts initially at and ends at 1. 

For the sake of simplicity, let us drop the index t from our notation since it does not play any 
role in our analysis. We denote the configuration at time t — 1 hy and the configuration at time 
t hy y^. Let A denote the hit cost vector A* and let i denote the location that A* penalizes. The 
intermediate states of the algorithm are denoted by y"^, r > 0, during the fix stage, and by y^, 
rj G [0, 1], during the hit stage. At each time rj € [0, 1] (respectively, r > 0), the algorithm specifies 

dyV dy'!' 

the derivative of each variable y^j (respectively, of each yjj ) . Denote by Tg the final value 
that T reaches during the fix stage. Eventually, each yjj is defined as follows. 

'•^ Jr=o or 7^=0 dri 

An important issue that needs to be addressed is proving that the differential equations speci- 
fying the derivatives at each step have a (unique) solution and thus the algorithm is well-defined. 
This proof turns out to be non-trivial in the case of the hit stage, since the derivative during this 
stage might change in a non-continuous manner. Nevertheless, as we will show, the process is still 
well-defined. 

Another technical detail is that during the hit stage, in intermediate times r; € [0, 1], we will 
not work with the hit cost vector A directly, but rather with a modified cost vector A'' that can 
vary with i]. (During the first reading, the reader may assume that A'' = A and skip the part below 
about blocks and go directly to the description of the fractional algorithm.) 

We initialize A'^ = A. To define A^ for ?] > 0, we need the notion of blocks. 



Vh = Vt' + + ^dry. (6) 
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Blocks: During the hit stage, for each rj G [0,1], we maintain a partition of the index set 
{1, . . . ,k + 1} (for location i) into blocks B^, B2 ■ ■ ■ , Bj. The collection of blocks is denoted by 
and it satisfies the following properties. 

1. ■ is identical for all indices j within any block B € V^. For future reference, let us denote 
by y^{B) this common value for all j € B. 

2. For any block B = {j, . . . , j + s — 1} of length s in V^, it holds that for every 1 < r < s, 

j'=3 j'=j 

That is, the average value of the A's in any prefix of a block is no more than the average of 
the entire block. 

We define A'' to be the cost vector obtained by averaging A over the blocks in V^. That is, for 
each B € V^, we set X{B) = {J2jeB and then 

_ r X{B) iii=l and j G B, for B € V'^, 
I otherwise. 

Now, in our algorithm, the initial partitioning of blocks is the trivial one, i.e., one in which 
each index j forms its own block. (Note that in this case we indeed have A'^ = A.) Next, blocks are 
updated as r] increases. For any r? > 0, if two consecutive blocks Bp, i?p+i G satisfy: 

y^{Bp)=y^{Bp+i) and X{Bp) < X{Bp+i), (8) 

then Bp and -Bp+i are merged and D"^ is modified accordingly. Note that the condition X{Bp) < 
X{Bp^i) guarantees that (7) is satisfied in the new block created by merging Bp and -Bp+i. As we 
shall see later (Lemma 11), a crucial property of the evolution of P'' during the hit stage is that 
Uj are updated in a way that guarantees that a block never splits once it is formed. 

The algorithm. We are now ready to state our algorithm. It is parameterized by a parameter 
e > that will be fixed later. 
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Fractional Allocation Algorithm: 

Set /3 = ^, a = ln(l + i) = ln(l + 
Fix stage: Set = y*~^ ■ 

For any r G [0, oo), while '^Zij vlj < kd — K{t) (i.e., while the total volume of servers exceeds 
the quota) we increase each variable yjj at a rate: 

dr \ yl^ = 1 

Denote by Tg the termination time of the fix stage. 

Hit stage: Set y^ to be the state obtained at the end of the fix stage*^. Define the following 
update rule for any t] G [0, 1]: 

• If Vij = kd - Hi{t), choose N{r]) > such that^ 



if either i^N{r]) - aXljj > and ylj = 1, 

or (^N{r]) - aXl^ < and yl - = (9) 
^ (yh + 1^) ■ (^('?) - "^m) otherwise 



dr] 



and 

• Otherwise (i.e., if y^^- > kd — K-it)), set A^(r/) = 0, and define the derivatives of the 
variables as above. 



Output: For each return y*^. ^ y*/ + ^dr + Xj^^ ^dr? 



"Note that upon termination of the fix stage, ^ yj"^ > kd — n{t). 

'As we show in Lemma fO, there is always a way of choosing N{ri) such that the desired conditions are 
satisfied. 



High-level intuition. Before proving correctness and analyzing the performance of the above 
algorithm, we provide some intuition on the dynamics underlying it. 

Dynamics of the fix stage: This is fairly straightforward. The algorithm simply increases all 
the variables yl ^ that are strictly less than 1 (which decreases the total number of servers), until 
the quota K{t) on the number of servers is met. We note that it may also be the case that the 
total number of servers used is already strictly smaller than the quota K{t) to begin with, e.g., if 
the server quota increases at time t. In this case, nothing is done during the fix stage. Notice that 
the rate of change of a variable yj ^ is proportional to its value, which means that the change is 
governed by an exponential function. This kind of update rule is in line with previous algorithms 
for weighted paging [2, 3]. 
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Dynamics of the hit stage: For simplicity, let us assume that during this stage we have that 
< y^j < 1, for all {i,j), and each y^j is a strictly increasing function of j. That is, 

o<y.':i<y.':2<---<y.'!.<i. (lo) 

for all locations i and r] € [0, 1]. 

Note that under this assumption condition (8) will never trigger. As a result, no blocks are 
merged and we have = A for all rj S [0, 1]. Furthermore, as in this case each variable y^j is 
strictly between and 1, its rate of change during the hit stage simplifies to: 



with 



^J:^,jy^,j>kd-^(t), 

^ My^^i+f^) ' ' otherwise (i.e. if E»j vlj = kd - K{t)). 



(12) 



(Note that the value of N{r]) in the second case of (12) is determined by the fact that 



drj 

has to be zero.) 

Let us study the dynamics given by (11) more carefully. Recall that Ajj- = for i ^ i. First, if 
the number of servers used is below the quota, i.e. if Yli j Vij > kd — K{t), the algorithm responds 
to the cost vector A by simply increasing the number of servers at location i by decreasing each y^ 

at a rate of :^{y^ j + P) ■ c^-^f To understand this better, it is instructive to consider the special 

case when Xjj = 1 for some particular index j and is otherwise (this corresponds to the hit cost 

vector h"^ that incurs cost 1 if there are strictly fewer than j servers at i and cost otherwise) . In 
this case, the algorithm reduces y^j and keeps other y^^'s unchanged (in particular yjj_^_i and y^j ^ 
remain unchanged). As y^^ and y^^ do not change while y^:- decreases, this has the effect of 
increasing the probability mass Xjj = Uj-^j^^ ~ Uj-j on {i,j), and decreasing the probability mass 
= Vjj ~ y~ij-i o^ {hj ~ 1)^ Moreover, note that the decrease in is exactly equal to the 

increase in x^^. 

Now, let us consider the case when the number of servers used is exactly equal to the quota. 
Here, we also need to ensure that the quota is maintained. This is done by offsetting the increase 
in the number of servers at location i (as described by the dynamics in the previous paragraph), 
by decreasing the number of servers at all locations (including i). This is precisely the purpose 
of the term N{ri) in (11). It increases y^ ^ (and hence decreases the number of servers) at a rate 
proportional to -^{yi j + /?) (as in the fix stage). Note that as Ajj- = for i 7^ i, this update can 
only decrease the number of servers at locations i ^ i. The overall number of servers at location i 
can only increase, but of course due to the redistribution of probability mass at i, it may happen 
that the probability mass at some (i, j) goes down. 

Unfortunately, when assumption (10) does not hold, the simple dynamics described above may 
produce infeasible configurations. First, increasing or decreasing variables according to (11) does 



^In the simplified discussion here we are implicitly assuming that > by assuming that > J/^-j_j- 
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not take into account that the variables need to stay in the range [0, 1], and hence this may be 
violated. This happens if (i) a variable is equal to and has a negative derivative, or (ii) when 
it is equal to 1 and has a positive derivative. To avoid this problem we need to deactivate such 
variables (by setting their derivative to be 0) when either one of these two cases occurs. Moreover, 
the above dynamics may also violate the monotonicity condition (4). To avoid this issue, we need 
to merge blocks and modify A'' accordingly, as was previously described. 

Now, the resulting algorithm does not produce infeasible configurations anymore. However, its 
dynamics is somewhat more involved. Before we discuss it, let us first provide a formal definition 
of an inactive coordinate or variable. 

Definition 9. During the fix stage, a coordinate {i,j) for which yjj < 1 is said to be active at time 
T. Otherwise it is said to he inactive. During the hit stage, coordinate {i,j) (or variable ■) is said 

to he inactive at time t] G [0, 1] if either (^N{i]) — aX^ > and y^j = 1, or (^N{rj) — ctA^^^ < 

and y^j = 0. Otherwise, coordinate {i,j) is said to he active. Denote by (respectively ) the 
set of active coordinates at time r] (respectively t). 

Now, by definition, during the hit stage at time rj only the active variables might change. So, 
we can compactly rewrite the evolution of the variables during the hit stage given in (9) as 

dyl^ _ / V . . . ifii,j)^Av, 

otherwise 

Furthermore, as we still require that = Yj(i,j)(^A^ "if = 0' if Y.^jVlj > kd - K{t), we 



\ ^ ( yh + /3 ) • ( Niri) - aXl ) otherwise 

still require tha 
have that N{r]) can be expressed as 



^^^E^,Jyl>kd-Kit), 



In light of the above, one can see that the simple evolution of the variables in the special case 
of (10), as described by (11), is a special case of the general evolution in which all coordinates 
are being active and A'' = A for all i] G [0, 1]. The reason why the analysis of the general process 
is more complicated is that the set of active coordinates (and the hit cost A'') can, in principle, 
change very abruptly between two values of r/. Moreover, as stated, equation (14) and Definition 
9, have a circular dependency. In particular, the value of N{ri) depends on the set , but in turn 
the definition of A^ also depends on the value of N{rj). As a result, a priori it is not even clear that 
our algorithm is well defined. That is, a unique trajectory consistent with our local evolutionary 
rules indeed exists. We proceed to proving this now. 



Well-definiteness of the algorithm. We start by addressing the above-mentioned issue of the 
circular dependency between the value of N{rj) and the set A'^. Note that it is not clear any more 
that there always exists a non-negative normalization factor N{r]) as required by our algorithm. 
As we prove in the next lemma, however, one can use a simple continuity argument to prove the 
existence of the desired normalization factor. 

dy'^ ■ dy'^ ■ 

Lemma 10. There exists a N{r]) > for which = 0, where the derivatives are as 

defined in the algorithm. Moreover, the set A^ of active coordinates is never empty. 
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Proof. Fix any rj G [0, 1]. Let us consider the function f{s) = "Yli ji^yl j/ dri)\^(^^-^^g, i.e., /(s) is the 
sum of all derivatives given by equation (9) for the case when N{ri) is equal to s. 

Clearly, if s = N{'q) = 0, then {dy^ j/di])\^(^^^^g < for each and hence f{s) < 0. If 

/(O) = then N{ri) satisfies the requirements. Note that in this case all coordinates that are 
non-zero are active. This set is non-empty as at the beginning of the hit stage the sum over all 
coordinates is at least kd — K{t) > 0. 

Thus, suppose that /(O) < 0. Let Amax = niaxjj X^-j be the largest entry in A''. Then, at 
s = N{r]) = aAmax we have (dy^ j/dr])\j\f(^^-^^s > for each (i, j) and hence f{s) > 0. 

Next, we claim that each derivative {dy^ j/ dr])\^[j^-^=s is a continuous function of s. To this end, 
note that if < • < 1, then the function {dy^ j/dr])\j\r(^^)^g is a linear function of s (and thus is 
continuous). For y'^j = 0, the function {dy'^ ■ / dr])\p^(jf^^g is zero for s < aXlj, and then increases 

linearly for s > aX^j - so, again, it is continuous. Similarly, for y^j = 1, (-^)|^(^)=^ is negative 
if s < ctX^-j, and increases linearly until s = a ■ X^j, and then remains zero. 

Now, as each derivative is continuous, so is /. Thus, we know that by the intermediate value the- 
orem the preimage /~^(0) in the interval [0, aAmax] is non-empty. Furthermore, as / is continuous 
and /(s) < for s = 0, there exists a minimal s* such that f{s*) = and < s* < aAmax- 

We take N{i]) = s* and claim that the corresponding set is non-empty (which would prove 
the lemma). To see why it is the case, note that if there exists a coordinate < y^j < 1 then it 
is always active and we are done. Otherwise, let us consider Aq (Ai) to be the set of (i, j) with 
y^_j = (resp. with y^j = 1). As the sum over all coordinates is at least kd — K{t) > 0, the set Ai is 
non-empty. Suppose for s* > all the coordinates in Aq,Ai are inactive, then by definition for all 
(i,j) G Ai, s* > aA^j (note the strict inequality), and for all coordinates in Aq, s* < aX^j. This is 
a contradiction to the minimality of s* as we could find a < s' < s*, such that f{s') = 0. □ 

Now, as our algorithm is defined via a set of differential equations indexed by r and r], to 
prove that it is well-defined we need to show that there exists a unique solution to this set, and 
furthermore this solution is feasible for the allocation problem. To this end, we prove the following 
lemma, whose proof is in Appendix A. 

Lemma 11. There exists a unique solution y'^ and y"^ , defined on the intervals r > 0, 77 G [0, 1], 
to the set of differential equations defined by the algorithm. Furthermore, the solution satisfies the 
following properties: 

• Boundary: For each {i,j), and for all < r, and < 77 < 1; < UijiUij ^ 1- 

• Monotonicity: For each {i,j), (j < k), y^ - < y^j^^ and yjj < yj^+i- 

• Quota: The expected number (volume) of servers at the end of the fix stage and at any 
Tj G [0, 1] does not exceed nit). That is, y"^ j > kd — K{t) for all rj G [0, 1]. 

• Blocks: During the hit stage. Blocks can only merge (and they never split). 

• Discontinuity: The total number of times rj G [0,1] that each location {i,j) changes its 
status from active to inactive, as well as the number of discontinuity points of N{r]) as a 
function ofij, is finite (in fact, polynomial in k and d). 
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3.2 Cost Accounting 



In this section we prove some helpful properties that allow us to charge the algorithm and the 
optimal solution in a continuous fashion. This will simplify the potential function based analysis 
that we later perform. First, we deal with the charging of the hit cost, and then with the accounting 
of the movement cost. 

Charging the hit cost. The issue we want to address here is that at a given time t the hit costs 
of the optimal solution and our algorithm depend only on the final states of both solutions. More 
precisely, if y* is the optimal solution at time t, and y = y^ is the final state of the algorithm at 
time t, then the hit cost of the optimal solution (respectively, of the algorithm) at time t is equal to 
A • y* (respectively, X-y). However, as our algorithm is described in a continuous fashion, it would 
be simpler to also have a way of accounting for the hit costs in a continuous and local fashion. 
In particular, we would like to account for the hit cost of the optimal solution as: 

r A^-y^-dr/, (15) 

Jrj=0 

and for the hit cost of the algorithm as: 

X'^-y'^dT]. (16) 

Jri=0 

Note that the above expressions can be interpreted as charging locally at every time rj € [0,1] an 
infinitesimally small hit cost of X^ ■y*drj (respectively, X^ -y^drj) to the optimal solution (respectively, 
to the algorithm). Now, to make this accounting valid, we need to show that the above expressions 
can only overestimate the hit cost of our algorithm and underestimate the hit cost of the optimal 
solution. We prove that this is indeed the case in the following lemma. 

Lemma 12. The following inequalities hold: 

t X^-y*-dr] < X-y*, (17) 

Jti=0 



t X^-y^'dv > X-y. (18) 

Jri=0 

Proof. We first prove inequality (17). To this end, we show that for any non-decreasing vector 
V = {vi,V2, . . and any < 771 < r/2 < 1 j it holds that 

X^' - vy X"^^ ■ V. (19) 

Note that as A = A^ and y* is feasible (and thus satisfies property (4)), taking v equal to y* 
immediately gives Inequality (17). 

By Lemma 11, we know that the only difference between A^^ and A^^ is that some of the 
blocks in T>^^ can be merged in D^'^. Therefore, it suffices to show that whenever two consecutive 
blocks Bi and B2 merge to form another block B, it holds that X{Bi) X^^g^^ Vi + A(i?2) SieSj ^* — 
X{B) ^jg^ Vi for any hit cost vector A. 
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Let ^1 = \Bi\,£2 = \B2\, and let ai = (Y^i^Si "2 = (I]ieB2 ^j)/^2- Then, by the definition 

of X{B), the inequahty above is equivalent to showing that 

XiB,)hai + XiB2)i2a2 > ( ^^^'^^| ^ ^™^' ) + «2^2). (20) 

As V is increasing we have ai < 02, and since Bi and B2 were merged, by (8) it must be that 
X(Bi) < A(i?2)- A direct calculation shows that (20) holds under these conditions. 

Now, to prove that inequality (18) also holds, we prove that whenever the derivative of • y"^ 
is defined, i.e., whenever neither nor change (which is the case except for possibly finitely 
many points, cf. Lemma 11) we have that 

< 0^ (21) 

dr) 

That is, the state y'^ evolves in a way that reduces the hit cost of the algorithm with respect to the 
corresponding hit cost vector. 

To see how (18) follows from (21) we first note that (21) implies that 

X^-y^>\'-y\ 

for any i] G [0, 1]. Now, we have for any block B G T)^^ y} . = y^{B) for all j G and thus 

^'■y'=T. T.^A.= E y'(B)E^,= E yHB)x\Bm = x^.y\ 

Bev^ jeB bgdi jeB Bev^ 

where we recall that X^B) = (E,^b A^p/I^l = {E,eB 
So, we can conclude that 

A'' • y'^dri > [ X^ ■ y^drj = [ A° • y^dr] = A° • = A • y. 

r?=0 Jti=0 Jri=0 

In light of the above, it remains to prove (21). To this end, recall that when and A'' are fixed, 
the evolution of y^^s is described by Equation (13), thus the statement we need to prove is 

E ^{yh + (')i^(^)-<^) ^ 0- (22) 

{i,j)eAv ' 

Plugging in the expression for N(r]) given by (14) and canceling a, we need to show that 



Jij)£Av / \{i,j)eAv 



Now, this inequality follows from the Cauchy-Schwarz inequality (a • b) < \a\2\b\2, by taking 



a to be the vector with entries atj = J —^^ — ( y'-j + (3), and b to be the vector with entries 
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Charging the movement cost. We turn our attention to the accounting of the movement cost 
of our algorithm. RecaU that the movement cost at time t is defined as 




(23) 

We would like to approximate this expression by one which is simpler and more convenient to work 
with. First, instead of keeping track of both increases and decreases of the variables ylj as in the 
above, we will account for the movement cost only through the increases of the variables j. That 
is, our bound for the movement cost is 

d 

(24) 




Note that we have for any coordinate and t >1 

Ivij - yl/\ < 2 • max{y*^ - yl/,0} + yj;/ - yjj, 
and thus on any input sequence consisting of T requests, it is the case that 



EE-^ Ei^-^G^i I <2-EE-. EmaxK,-,-\o}) [Eyl^-yl, 

t=l i=l \j=l J t=l i=l \j=l ) i=\ \i=l 

Thus, accounting for the movement cost via expression (24) approximates the true movement cost 
(corresponding to expression (23)) up to a multiplicative factor of two and an additive factor of 
at most X^f^x ^"^i^ which depends only on the starting and final configuration, and is zero if the 
two configurations coincide. As we will see, for the sake of our competitive analysis, such an 
approximation suffices. 

Next, similarly to the way we accounted for the hit cost described above, we wish to further 
simplify the charging of the movement cost and perform it in a continuous and local fashion. 
Namely, it is easy to see that the following quantity 

1 . rl Ai!^ 




dT + / max{— -^,0}(ir/ 
Jv=o dr] 



r=0 dT Jri=0 

can only overestimate the movement cost given by (24). (Note that is always non-negative. 
Furthermore, as the derivatives can only be positive if ^ A^, we can write 



1 

ri=0 



dy^ 

drj 



max { , 



where we used (13) and the fact that, trivially, N[ri) — }^ ■ < N(r]). Thus, we can use the above 
in our final version of the estimate of the movement cost. The following claim summarizes the 
discussion. 
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Claim 13. For any sequence of requests of length T > 1, 
T d / k \ 

EE-. EW.-!/.'-}'! £ 

i=i i=i \j=i j 

^ ( ^ du^ ■ 1 / N \ 

2 ■ E E ». g /„„ + ^M'^^ ^'"^ ■ j + ^' 

where C < Y^- kwi depends only on the starting and final configuration of the algorithm, and C" = 
if the two configurations coincide. 

Thus, at time t, the movement cost of our algorithm is given by: 

E (e ll + /I i + /j) "0) • ■ (25) 

3.3 Competitive Analysis 

We are finally ready to bound the competitiveness of our algorithm. To this end, we prove the 
following theorem. 

Theorem 14. Consider an arbitrary instance of the allocation problem with cost vectors h^^K^ , . . ., 
a starting configuration Ip and a quota pattern k = k(2), . . .). For any < e < 1, we have 

the following bounds: 

H < {I + e) {Opt + w„,^,-g{k)) + C, 
M < 0(log(A:/e)) • (Opt + lo^ax • • 

Here, H and M denote the hit and movement costs of our fractional algorithm, and Opt denotes 
the sum of the total hit and movement costs of a fixed integral optimum solution to the allocation 
problem instance. Let g{K) := [^(i) — ^(t — 1)|, and denote by Wmax = maxj the diameter of 
our metric space. Let C be a quantity that depends only on the start and final configurations of the 
online algorithm, and C = if the two configurations coincide. 

It is easy to see that Theorem 5 immediately follows from the above theorem. 

To prove Theorem 14 we employ a potential function approach. Namely, we define potentials 
^^{y,t) and ^"^{y,t) that depend on the state y of the online algorithm and on the state of some 
arbitrary fixed integral optimum solution at time t. Then we show that the following inequalities 
are satisfied at each time step t. 



Mf + A$r < (l + e)-a-(w^max-|At(t)-K(t-l)j+M% + H%), (26) 

Rt + A^t+-^^T < {l + e){w^,^-\K{t)-K{t-l)\ + M*t + R*t). (27) 

a 

Here, (respectively H*^) and (respectively M*^) denote the hit and movement costs 
incurred by the algorithm (respectively optimum) at time t. The quantities 
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respectively denote the change in the potentials $ and $™ at time step t. 

As we shall see, it will be the case that t) > and <I>™(y'^,0) = 0. Moreover, both 

$"^(y, t) and ^^{y,t) will be bounded by some universal constant C, independent of the length of 
the request sequence. Thus, Theorem 14 will follow by summing up (26) and (27) over all times t. 
(Note that in our algorithm a = log(l + 1/13) = 0(log(/c/e)).) 



The potential functions. The potential function <I>™ is defined as follows. 

1 + /3 



Here, y** denotes the configuration of the optimum solution at time t. Note that if the configuration 
and 1 

location i to $"*(y*,t) is 



y** is integral and the optimum has servers at location i at time t, then the contribution of 




So, roughly speaking, $™'(y*,t) accounts for the excess servers in the online configuration 
at location i compared to the optimum solution. For example, suppose that y* has ki servers at 
location i, i.e., y*^ = for j < ki, for some ki > k* , and yjj = 1 otherwise. Then the contribution 
of location i to <I>™(y*,t) is 0{wi{ki — k*) log /c). Intuitively, the offline adversary can penalize the 
online algorithm for "wasting" ki — k* servers at i, by giving cost vectors at the other locations 
(where the optimum has more servers), and making it pay a larger hit cost. The potential $™(y*, t) 
will be used to offset this additional hit cost in such situations. 

Next, we define the potential to be 



It is easily verified that both potentials are bounded. Moreover $"^(y'',0) = 0, as both offline and 
online are assumed to start with the same initial configuration. 



The proof plan. Our goal now is to show that Inequalities (26) and (27) always hold. For ease 
of analysis, we will consider the events at time t in three steps, and show that Inequalities (26) and 
(27) hold at each of these steps. The steps are the following: 

1. The quota K,{t) either increases, decreases, or stays unchanged, compared to n{t — 1), and the 
optimal solution removes/adds servers accordingly. 

2. The optimal solution moves some servers and its state changes from y**~^ to y**. 

3. The online algorithm changes its state from y*~^ to y*: first, we analyze the fix stage then 
we analyze the hit stage. Also, while analyzing the hit stage, the hit costs of both the online 
algorithm and the optimal solution are accounted for. 
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The server quota increases/decreases and the optimum removes/adds servers. Note 
that the only quantities that can change in this step are the movement cost of the optimum and 
the potential thus it suffices to prove that Inequality (26) is preserved (in this case, (27) is 
identical to (26) scaled by 1/a). Now, there are two cases to consider, depending on how ti{t) 
changes. 

• Suppose «;(t) < K(t— 1), then the optimum has to withdraw \K{t) — K{t — 1) \ servers from some 
locations. That is, for \K{t) — K{t — 1)| locations the corresponding variables y*^j are 

set to 1, and as a result these locations start contributing to Clearly, each such location 
(i,j) increases $™' by at most 

(1 + e)Wi ■ log ( }t ^ ^ ) < (1 + e) In ( 1 + ^ ) • ^fmax = (1 + e)a • l^max- 

As a result, the total increase of is a.t most tt'rnax * 

(1 + e)a ■ \n{t) — K{t — 1)|, and hence 
(26) holds even without accounting for the movement cost of the optimum. 



Suppose K(t) > K{t — 1), then the optimum can bring in |K(t) — K(t — 1)| servers, and as a result 
can only decrease, as some terms y^ - may change from 1 to 0, thus not contributing 
anymore to 4'"'. Hence, (26) also holds in this case. 



The optimum moves its servers. Without loss of generality, it suffices to analyze the case in 
which the optimum moves exactly one server from location i to i' . (If multiple servers are moved, 
we can consider these moves one by one.) Also, as before, only <!>'" and the offiine movement cost 
change, and hence it suffices to just show that (26) holds, and in particular that 

< {l + e)aM*t. 

Suppose that location i had j servers prior to the movementand this number is reduced to j — 1 
(recall that by our convention we account only for the movement cost corresponding to withdrawal 
of servers). Then, the contribution of location i to ^>'" increases by precisely 

Wiil + e) • log [ t-t\ ] ^ M'^ + e)-a = {l + e)aM*t. 

\yi,j +pj 

In contrast, increasing the number of servers at i' can only decrease Thus, we get that the 
desired inequalities hold in this case. 



The online algorithm is executed. The case in which the online algorithm changes its distri- 
bution is the most interesting, and we analyze Inequalities (26) and (27) separately. Recall that 
our online algorithm works in two steps: the fix stage and the hit stage, and hence we consider 
these separately. Moreover, as the evolution of the algorithm is described in a continuous manner, 
we will analyze Inequalities (26) and (27) in such a manner too. 
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The fix stage: proof of Inequality (26). To show that Inequahty (26) holds during the fix 
stage it suffices to prove that for any r, 

dMt d^"" , , 

— ^ + -— < 0. (28) 
ar ar 

By definition of during the fix stage and our way of accounting for the movementcost (cf. 
Claim 13) we have 



dr Wj 

(ij)eA- (i,j)GA- 

(Recall that A"^ is the set of active coordinates i.e. those for which t/ij < 1.) Also, it is easy 

to see that the change in the potential function is 

^ = -(l + e) E 

ii,j)eA^ 

Next we need to prove the following claim 

Claim 15. Consider a subset A of the coordinates and two configurations y andy' with j^^^ y[ j > 
1, and Y^- ■ y[ j > kd — k, we have then that 

(yij + /3) - (1 + e) y'ij^ E E y'ij- 

{i,j)eA (i,j)&A {i,j)eA {i,j)eA 

Before we prove this claim, let us describe how (28) follows from it. If we set y = y'^ , y' = y**, 
and A = A'^, then, clearly Xli j Vi^'j = kd — K{t) > kd — k. Furthermore, since {i,j) ^ A'^ only if 
yjj = 1, and as we apply the fix stage only if YlijUlj < kd — K,{t), we need to have \A'^\ > K{t), 
and thus 

Y y*!j > kd - K{t) - \A^\ > 1. 
So, both requirements of the claim are satisfied and it follows that 

^ + ^= E E y*:^^ E yi.- E ^^^^o, (29) 



where the last inequality follows, as j yjj < kd — K{t) = j y* j-, and {i,j) ^ A'^ only if yjj = 1 
implies that J2{i,j)eA- Vlj < ^{i,j)eA- Vij- 

Proof of Claim 15. As '^ZijV'i,] > kd — k and j)(=,Ay'i,j — have that 

^ - E yi^ ^ ^ - "^^^^1^1 - i> ^ 0' ^30) 

iid)eA 

where the last inequality follows as < 1 when \A\ < k, and < |^| — k when |^| > A; + 1. 
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Now, the claim is proved by noticing that 



+ I E y^'i- E y'i^i 



^ E y^'j- E 

{«j)eA (i,j)eA 



where the inequahty follows by (30). □ 

The fix stage, proof of Inequality (27). To prove that Inequality (27) holds during the fix 
stage as well, we note that as we are accounting for the hit cost during the hit stage, currently 
there is no hit cost incurred, and hence we just need to show that 

d^^ 1 d^"' , , 

— + --— = 0. 31 
ar a dr 

Observe that 



Wi- 



dr a ^ dr 

(i,j)eA^ 

which is exactly 1/a times our accounting for the movement cost in the fix stage (cf. Claim 13). 
Therefore, (31) is identical to (29) (up to a scaling by 1/a), and hence follows from the above proof. 

The hit stage, proof of Inequality (27). Recall that both the optimal solution and the online 
algorithm incur a hit cost in this stage. We start by proving that Inequality (27) holds during this 
stage - later, we will analyze Inequality (26). We need to show that 

dHt d^'} 1 d^Y" , ^ dH* , , 

— ^ + + -— ^ < (1 + e)—^. (32) 
a?/ dr] a dij drj 

First, we note that by our way of accounting for the hit cost of the algorithm (cf. (16)) 

iJ {■i,j)&Av 

Now, we have that 

= E K^y^. + l E iyh+^)-^iv)- E n 

{i,j)tA^ (i,j)GA'7 {i,j)€Av 



^ E K^yh + ^ E (yh + ^)-^iv)= E >^A^ + 



1 dMt 



(i,j)^A'; (j,j)eA'7 {jj)^Ai 
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where in the last step we used the expression for our movement cost accounting (cf. Claim 13), 

dMt 



dri 

Thus, to establish (32), it suffices to show that 



E (yl + f3]-N{v). (33) 



(34) 



By our update rule for y'' ., the derivative of <1>™ is 



dri 

Finally, by our way of accounting for the hit cost of the optimal solution (cf. (15)) we have 

dH*^ 



dri 

Thus, after multiplying Inequality (34) by a and plugging in the above equalities, we need to prove 
that 

« E K^yh+ E {yl,+P)-N{ri)-{l+e) ytr{^{v)-a>^l^-{l+e)aY,\lyt,<Q- 

{id)tA-i {i,j)€Av (i,j)&Av i,j 

(35) 

Let A' be the set of all coordinates that are either active or have y^j = 0. As any inactive coordinate 
has y^j e {0, 1}, we observe that: 

• If (i,j) € A'\A^, then it must be that y^j = and ctX^j > N(rj). This holds, as a coordinate 
for which y^ j = can be inactive only if = {y^- + fi){N{r]) — aX) < 0. 

• If (i,j) ^ A' it must be that y^ ■ = 1 and oA['j < N{r]). This holds, as a coordinate for which 
y^j = 1 can be inactive only if = [y^ ■ + f3){N{7]) — aX) > 0. 

Thus, using the above observations, we may rewrite (35) and get that 



^(^)-| E (yl + (3)-{i+e) E -(i + ^)« E K,y*!j + » E K^yl 



< mv) ■ E (yl + /3) - (1 + ^) E y*!j - (i + E Kjy*!j + « E K^y 



{i,j)eA' {i,j)&A' I {i,MA' {i,mAv 



^ ^(^)-| E {yl + P)-i^ + ^) E E K^{yh-y*!j 

< ^(^)-| E + + E y*!j+ E {y^,j-y*!j 
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The first inequahty fohows as (i, j) G \ A^, then it must be that aX^j > N{rj), and because 
C A'. The second inequahty follows since for ah G A' \ A"^ it must be that ylj = 0. 

The third inequality follows as OiX^j < N{r]) for ^ A', and by the observation that for each 

{i,j)^A',yl^ = l>y*^.. 

If N{r]) = 0, then the above expression equals 0, and we are done. Otherwise, if N{rj) > 0, we 

get that 



^ E ^i;.- E ^5+ E 

(*,i)eA' {ij)6A' {i,MA' 

Inequality (36) follows from Claim 15 by the following arguments. ^From Lemma 10 we know that 
the set A^ is non-empty. We claim that this implies that ^j-j y^j has to be positive. Otherwise, 
all the active coordinates would have y^ ^ = 0, and thus could only increase, contradicting the fact 

that when N{rj) > 0, Eij^if = E(^J)6A'7 ^if = 0. Moreover, as N{ri) > 0, ZijVlj is equal 
to ^ijy** = kd — K{t) (and thus is integral). Now, as y^j = 1 if ^ A', we have that 

Y,{i,j)£A' Vij ^ Z](ij)eA' Vij ^^^^ that Y.{i,j)eA' ylj is integral as weh. As this quantity is positive 
by the argument above, we get that 

E E ^M-^i- (38) 

{jj)GA' (j,j)GA' 

Thus, we can use Claim 15 with y = Ij^, y' = y** and A = A' (as all the requirements of this 
claim are satisfied). The last equality follows, since for N{rj) > 0, we get from the algorithm, 

E.,yl, = kd-.it) = z.^^y*^. 



The hit stage, proof of Inequality (26). To show that (26) holds, we need to show that 

_ + _i_<(l + ,)„._ (39) 

However, this follows directly by noting that the above is simply Inequality (34) after removing the 
first (non-negative) term and scaling by a. 



4 Fractional /c-server on Weighted HSTs 

In this section, we show how the fractional allocation algorithm on a weighted star can be used as 
a building block to obtain a fractional fc-server solution on a weighted HST. In particular, we prove 
the following. 

Theorem 6. Let T he a weighted a -HST with depth i. If, for any < e < 1, there exists a 
{l + e,log{k/e))- competitive algorithm for the fractional allocation problem on a weighted star, then 
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there is an O{i\og{ki))-competitive algorithm for the fractional k-server problem on T, provided 

a = n{nog{ke)). 



To this end, we focus on a particular weighted cj-HST T and show how to construct a fractional 
k-seiver algorithm on it. Roughly speaking, the construction works as follows. Each internal node 
p of T will run a number of instances of the allocation problem which differ with respect to their 
quota patterns, but have the same hit cost vectors. These instances are maintained as a convex 
combination. The fractional solutions to the different instances, which we compute online using 
the fractional allocation algorithm, determine in a recursive manner how the servers available at 
each node are distributed among its children. 

While this approach is similar to the approach of Cote et al. [16], the main difference here is 
that we can use the (much weaker) fractional allocation problem instead of using a randomized 
(integral) algorithm for the allocation problem. 

Let us denote by r the root of our cj-HST T. Recall that for a node p of T, T{p) denotes the 
subtree rooted at p, W{p) is the length of the edge connecting p to its parent, and w{p,i) denotes 
the length of the edge connecting p to its child pi. By the definition of a weighted cr-HST, we have 
W{p) > aw{p, i) for all children i of p, unless p is either a leaf or the root. 

Recall that the input to the fractional allocation problem running at node p consists of the 
quota pattern k = (ft;(l),K(2), . . .) specifying the number of servers K{t) available at each time t, 
and the hit cost vectors /i* that arrive at each time t at location i*. The output of an algorithm for 
the fractional allocation problem specifies a fractional solution that provides a distribution on 
the number of servers at each location pj, subject to the aggregate bound of K{t) on the (expected) 
number of servers. 

Now, let us fix some instance p of the fc-server problem on the leaves of T. Let p = {p{l), p{2), . . .) 
be the request sequence, where p{t) denotes the leaf requested at time t. 

Definition 16. For a node p, integer j and time t, let Optcost(p, j • 1, t) he the optimum cost for 
serving the k-server instance {p(l), . . . ■:P{t)} n T{p), i.e. the request sequence p restricted to the 
leaves ofT{p) until time t, subject to the constraint that exactly j servers are available. 

Remark: Optcost is well defined only with respect to an initial configuration, which we will 
always assume to be the initial starting positions of the servers at i = 0. Also, we use the notation 
Optcost(p, J • l,t), instead of just Optcost (p, j, t), as we will later extend the definition of Optcost 
to the case in which j can vary with time. For now, we only consider fixed j. 

4.1 The algorithm 

In this section we define the ensemble of allocation problems that will be running at each of the 
internal nodes of T. To do this, we have to define how the hit-cost vectors and the quota patterns 
are generated. Consider some internal node p. As mentioned earlier, each internal node p will run 
several instances of the allocation problem that are different with respect to their quota pattern. 
It will also hold a convex combination over these instances. All instances will have the same hit 
cost vector that will be defined later. We denote the convex combination over allocation instances 
on node p at time t by A* . A* is specified via the collection 

= {{^p,s-. '^p,s-. Hp)}s, yt,p, ^ A* = 1 

s 
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Here A* ^ determines the fraction at time t given to the instance with quota pattern ^ (until 
time t), and Hp = {hp, hp, . . . , hp} is the sequence of hit cost vectors that have appeared until 
time t. As we will see shortly, the hit cost vector will be the same for all instances s € A* and 
therefore there is no subscript s to the hit cost. We will use the notation s E A* to index the triples 
{Xpg,K,pg,Hp) in A*. As we shall see later, the convex combination A* will be a refinement of the 
convex combination A*~^, for every t. 

To complete the description of the fractional /c-server algorithm we need to define A* for each 
node p and time t, and show how the fractional number of servers at the leaves of T is computed. 
We begin with defining how the hit costs /i* are generated for each node p. 

Hit costs: Consider any internal node p. Let pi, . . . ,pd be the children of p. For the allocation 
problems running at p, at time t we give the hit cost vector 

^p,(j) = Optcost(pi, j • 1,0 - Optcost(pi, j ■l,t- 1). 

As Cote et al. [16] prove, the cost vectors hp^ have the desired monotonicity property, i.e., hp^{0) > 
hp.{^) > • • • > hp.{k) for each i and time t. The following crucial observation follows directly from 
the definition of the A:-server problem. 

Observation 17. Consider subtree T{p) and request p{t). If p{t) € T[pi), then 

1. hp{i, 0) = oo. (This follows since any 0-server solution is infeasible for any instance with one 
or more requests, or equivalently incurs infinite cost.). 

2. hp{i' ,j) = for all i' ^ i and for all j. (This follows simply since the request is not in the 
.sub-tree of Pii for i' / i.). 

This completes the description of the cost vectors of node p. We next define the quota patterns 
Kp^sii) for the various allocation instances running at node p. 

Quota patterns: The quota patterns are determined recursively in a top down manner over 
the tree T (and inductively over time) by the fractional solutions of the allocation instances that 
are generated at each node. To specify how these patterns evolve, we describe below a procedure 
for updating both the quota patterns k^^^ and the convex combination A* associated with the 
allocation instances maintained. 

Base case: 

1. At the root r of the tree T there is a single allocation instance running with a quota of k at 
all times. That is, A* consists of a single allocation instance (with fraction 1), hit costs as 
described above, and k = k ■ 1. 

2. For any internal node p G T and time t = 0, A^ consists of a single allocation instance (with 
fraction 1). The quota pattern Kp^s for this single instance s, until time t = 0, is simply the 
number of servers present initially at the leaves of subtree T{p). Moreover, there is no hit 
cost thus far. 
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The inductive step: Consider time t. We describe the procedure to obtain A* from A*~^ in a top 
down manner on the tree as foUows. As the base case, recall that A* has already been determined 
for all t. Arguing inductively top down on the tree, suppose that A* has already been determined. 
Then, for the children pi, . . . ,pd of p, we determine A* as follows. 

Consider the allocation instances that are executed at node p. Let {xlj g}ij^s be the fractional 
solutions generated (by the allocation instances) at time t. The algorithm will maintain the fol- 
lowing consistency between the quota for servers available at pi and what the allocation problems 
running at the parent p determine. In particular. 



(Consistency) 



A 



*6A' I K*(t)=i 



A t 
— T ■ ■ 



(40) 



Also, for each child pi it should maintain 



(Convex combination) 



1. 



(41) 



Suppose that changes to xjj due to the execution of the allocation instances s G A* at 
time t. We show how to update A*, from A*"-*^ such that it remains consistent with (40) and (41). 
This update will be done in a natural (and cost-efficient) way. 

Consider first the cost paid by the convex combination of the allocation instances running at 
node p. The cost is 



E 

£<j 



> 



^w{p,i)Y^ 



X 



(42) 



where the inequality follows, since for any non-negative numbers pi, '^iPi\ai\ > | X^jPiOij. 

We note that the change from x*^^ to xjj can be decomposed into a collection of elementary 

moves in which zb5(i, j) units of mass are removed from Xij and put on Xijj-i, such that the total 

fractional movement cost remains the same. Thus, we can assume without loss of generality that 
t 

Consider an elementary move where x- 



and xl / differ by an elementary move. 



6 and x 



_i + S (all other types of 

elementary moves are handled analogously). To implement this, we choose an arbitrary 5 measure 
of allocation problems s € A*~^ with Ks{t — 1) = j and set Ks{t) to j — 1. For all other k^, we 
set Ks{t) = Ks{t — 1). After all entries are updated by applying the elementary moves, Ks{t) is 
determined. 

It is clear that this update rule maintains both (40) and (41). This completes the procedure for 
obtaining A* from A*~^. 



Obtaining the fractional fc-server solution: To complete the description of our algorithm we 
should describe how to determine the fractional number of servers at each leaf q at each time t. 
This is determined in a natural way using the following observation. Consider a leaf q and let p be 
its parent. Then, 

z{Q,t) ■= E -^*sE-^ ■ 4j> 
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is the number of servers at q at time t. Here, x^g j g is the probabihty of having j servers at q at 
time t, when the fractional allocation algorithm is applied to the allocation instance s € A*. 

4.2 Feasibility 

We first note that our fractional /c-server solution is feasible since it satisfies the following. 

Lemma 18. Whenever there is a k-server request p{t), then there is at least one server unit at the 
location p{t), i.e. -Zp(t),t > 1- This holds provided the total cost incurred by the allocation problems 
is finite. 

Proof. The lemma follows by the way the hit costs are generated. Suppose leaf q is requested at 
time t, and q is the i-th child of its parent p. Then, by observation 17 (part 1), the hit cost entry 
/i*(i,0) for every allocation instance running at p is oo. Thus, if the total cost of the allocation 
problems is finite, it must be that for each s € A*, the algorithm ensures that x^q^ = 0. Since, 
Si ^g,j,s = 1 ^' ^^'^ Es6A^ As = 1, it follows that z{q, t) = EsgA^ Y.j 3 ' x\^j,s ^1- 1^ 

Remark: Lemma 18 assumes that the total cost of the allocation problems is finite. Later on 
we show that the cost is in fact bounded by at most a polylogarithmic factor from the optimal 
/c-server cost, and hence finite. 

4.3 Performance analysis 

We first show that the cost of the fractional fc-server solution we generate (at the leaves of the tree) 
is at most the total convex combination cost of the allocation instances running on T. For a node 
p (not necessarily a leaf) in T, let z{p, t) denote the total (fractional) number of servers at time t 
at the leaves of the subtree T{p). The cost of the A;-server solution is 

j;j^VF(p)|z(p,t)-z(p,t-l)|. 

t p 

Lemma 19. The movement cost incurred by the fractional k-server solution is no more than the 
total movement cost incurred by the convex combination of the allocation instances running on 
internal nodes ofT. 

Proof. First, we claim that z{p,t) = X^^g^t A*K*(t). This follows from the consistency relation (40) 

we maintain, and our procedure for generating A^ from A*~^. In particular, ^^(i) is the number of 
servers available for the fractional allocation instance s running at p. Since the solution produced 
by the fractional allocation algorithm on this instance satisfies j ■x\ ^ ^ = Kg{t)J this implies 

that 

seAl seAl i j i j s'eA^jK*,(t)=j i s'eA^^ 

^Note that when we designed the fractional allocation algorithm in Section 3, we allowed it to deploy less servers 
than the current quota. As a result, when applying this algorithm here we could sometimes have J]]^ J ' ^\.j,a < 
K!s{t). To see that our analysis is still valid in this case, it suffices to consider a modified version of the tree T. 
In this version, each non-leaf node p of T would have a dummy leaf ip (that will never be requested in our input 
sequence) added as its child and set the length w{p,ip) of the corresponding edge to 0. Now, we would just make 
each instance of the fractional allocation run at each such p deposit any unused quota of servers at the leaf ip. Note 
that as w(p, ip) = 0, this depositing would not incur any additional movement cost and that the modified tree would 
still be a weighted cr-HST. 
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The second equality above follows from (40). Applying this iteratively, and noting that z{q,t) for 
a leaf q is simply Yls'eA^ ^s'^MO; it follows that z{p,t) = XIsgA* ^s^s(*)- 

Suppose that 6 = z(p,t) — z{p,t — 1) > server units are removed from the subtree T{p) (the 
case when 5 < is analogous). Let p' be the parent of p (note that p ^ r, since z{r,t) = k for 
all t). As z{p,t) = "^g^ji^t^ ^^'^Zjj • it follows that the allocation algorithm running on the 

instances s S A*, will incur a movement cost of at least 



> W{p) 



W{p) 



W{p) 



p 



j=l £<j 



t 



X. 



t-1 N 



j=0 



where we used the fact that k ■ ^ 



j=o ■^p,j,s 



k 



E^^E 

seA*, j=o 
p 

W{p) ■ \z{p,t) - z{p,t 



t 

P,j,s 



t-1 

>i=0 ■^p,j,s 



□ 



Given Lemma 18 and 19 above, it suffices to consider the total movement cost incurred by the 
allocations instances running on the tree T and compare it with the optimum k-seiver cost. This 
will be our goal in the following. We begin by defining a notion of optimum fc-server cost on a 
weighted ci-HST T when k varies over time. 

Definition 20. Let T{p) he the subtree rooted at p, and let k be a quota pattern. We define 
Optcost(p, K, i) as the optimum cost of serving the request sequence pr\T{p) until time step t 
subject to the constraint that K(t') servers are available at each time t' , for 1 < t' <t. 

We should be precise about the meaning of a k-server solution on T in the case K(t) can vary. 
First, at any time t' there should be one server unit at the requested location p{t'). The cost of 
the solution is the total movement cost of the servers. The servers are always located on the leaves 
of T. At time t, when the number of servers changes from n^t — 1) to n{t), we will require that 
K{t) — K(t — 1) servers enter (or leave, if K(t) < n{t — 1)) from the root of T. 

For a vector k, let us define g{n,t) = X]*'=i l'^(*') ~ ^{^' ~ ^)\- "^^^ following is a simple but 
very useful fact about Optcost, that we will need. 

Lemma 21. Let p he an internal node in T with children For any k-server request 

sequence p on the leaves of T and any quota pattern vector k, the following recurrence holds. 



Optcost (p, K, t) 



mm 



^1 vi'^d'Ei— 1 — 



E Optcost(pi, Ki,t) + w{p, i) E a{i^i^t) 



(43) 



Ki=l 



i=l 



Here, in the base case in which p is a leaf, define Optcost (p, k, t) = oo if there is some time 
t' < t such that p{t') = p and n{t') = 0. Otherwise, if n{t') > 1, whenever p{t') = p, define 
Optcost (p, K,t) = 0. 
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Proof. The condition X]^=i i^i = ensures consistency between the number of servers in T{p) and 
its subtrees T{pi). The term Optcost(pj, Kj, t) measures the cost of serving the requests within 
T{pi) and g{Ki,t) measures the cost of servers leaving or entering subtree T{pi). □ 

Next, we need the following key lemma that relates Optcost(p, k, t) to our procedure for gener- 
ating hit costs at node p. 

Lemma 22. Let p he a non-root node ofT. Given a quota pattern k forT{p), let Optcost(p, k, t) 
be as defined above. Then, 



Optcost(p, - hp{K{t')) 



t'=i 



<2-^-Wip)-giK,t), (44) 
a — I 



where hp{j) = Optcost(p, j • 1, t') — Optcost(p, j ■ 1, t' — 1) denotes the incremental cost of the optimal 
k-server solution for T{p) with exactly j servers. 

This lemma (for the case of HSTs) is implicit in the work of Cote et al. [16]. For completeness, 
and since we need the extension to weighted cj-HSTs, we give a proof of Lemma 22 in Appendix B. 
We are now ready to prove the following theorem. 

Theorem 23. Let T be a weighted a-HST with a > 9, depth I, and diameter A. Let p be a k-server 
request sequence on T, and k be the quota pattern. Consider the total movement cost incurred by 
the convex combination of the allocation instances running on nodes of T (based on the algorithm 
described in Section 4.1). This cost is no more than 

/3^(Optcost(r, K, oo) + A • g{K, oo)), 

where (5^ satisfies the recurrence = 7/3£_i + O (log(A;/e)), and /3o = 1- Here, e is any constant 
for which the fractional allocation algorithm is (1 + e, 0{[og{k/ e)))- competitive, and 

7 = (l + e)(l + f) +o{^\og{k/E) 

We first show how Theorem 23 implies Theorem 6. The recurrence (5^ < 7/3£_i + 0(log(fc/e)) 
in Theorem 23, together with /3q = 1, implies that 

/y+i _ 1 

f3i = Oi\og{k/e)) i 

V 7 - 1 

Choosing e = l/{4£), and provided a = 0(e^^ log(fc/e)) = Q{£log{k£)), we get that 7 < (1 + ^), 
and hence 

(3e = 0{nog{k/e)) = 0{nog{k£)). 

As g{k- 1, 00) = 0, this implies an 0(^log(A;^)) guarantee for a weighted cr-HST of depth £, provided 
a = Q{ilog{ki)). We now prove Theorem 23. 

Proof. (Theorem 23): We prove by induction on the depth of the tree. 
Base case: the theorem is clearly true for i = (i.e. a single point space). 
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Inductive step: suppose the theorem is true for weighted cj-HSTs of depth i — 1, and let T be 
a weighted cr-HST of depth i, rooted at r. Let Wi be the distance to the i-th. child of r, and let 
w = maxj Wi. Given k, consider some optimal solution for T that achieves value Optcost(r, k, oo). 
We also denote the total cost Optcost(r, k, oo) by Optcost(r, k), and g{K, oo) by g{K). Let k* 
be optimal vectors for the children pi of r corresponding to this solution. Since k* determines 
Optcost(r, k), by (43) we have 

Optcost(r, k) = (Optcost(pj, k*) + Wi ■ g{Ki)) ■ 

i 

By (44), for each child i, Optcost(pj, k*) > Hitcost(i, k*) — 2wi ■ g{K*)/{a — 1), where we denote 
Hitcost(i,K') = Thus, 



Optcost(r, k) > ^Hitcost(z, k,*) + Wi ■ ^1 



9{<) 



cr - 1, 

Multiplying throughout by {a — l)/{cr — 3), which is at most 1 + 3/a (as cr > 9), implies 

y^(Hitcost(i, K*) + Wi ■ gin*)) < - — - ■ Optcost(r, k) < ( 1 + -] ■ Optcost(r, k). (45) 
^-^ cr — 3 \ a J 

Consider the fractional allocation instance A running at r in our algorithm and let {x\j}ij be its 
solution at time step t. (Since there is only a single quota pattern K(r) at the root, we assume that 
there is only one instance running, and keep it notationally convenient.) By Theorem 14 and by 
(45), the hit cost incurred by A satisfies 

d k / d d \ 

ZlI]Z]4i^i(i) ^ {l + e)i J2mtcost{i, Hi*) + Y,w^ -giK*) +w -giK)] 

t i=l j=0 \i=l i=l / 

< (1 + e) ^1 + Optcost(r,K) + (1 + e)?i; •5(k), (46) 
and the movement cost satisfies 

d / d d \ 

Y.Y.'^'Yl \yij - yt/\ ^ 0{log{k/e)) Hitcost(i, O + Y,^i- aiO + w ■ g{K) 
t i=l j \i=l 1=1 / 

< 0{log{k/e)){Optcost{r,K) + w ■ g{K)). (47) 

Now, recall that each of the children pi, . . . ,pd of r is running a convex combination on allocation 
instances A-, the quota pattern of which is determined by {x-^ jjj-. So, the hit costs and movement 
costs of A (i.e. left hand sides of (46) and (47)) can be expressed alternately as follows. Since 
the quota patterns at pi maintain the invariant (40) throughout the algorithm, the total hit cost 
accumulated by A can be expressed as 



t i=l j=Q t i=l j=0 seA\,K.t(t)=j 

d 

= E E ArHitcost(i,C)- (48) 

i=l sGA°° 
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Henceforth, we use Aj and to denote A?° and Kf. By our cost preserving procedure for updating 
vlj-^ changes to xj j, 



Kg when X* changes to x* •, the movement cost incurred by A can be expressed as 



(50) 



d d 

i i=l j i=l t sgA* 

d 

= ^Wi^\sg{i^s)- (49) 

i=l s£ki 

Let us now consider the overall movement cost incurred by the convex combination of the allocation 
instances. This is equal to the movement cost for A (running at the root) plus the sum movement 
costs incurred within pi, . . . ,pd- By the induction hypothesis, the movement cost for each of these 
recursive algorithms that are run on subtrees T[pi) with quota pattern is at most 

Optcost(pi, Ks) H ■ g{Ki) 

a -I 

Thus, the total recursive cost is at most 

E E ( Optcost(pi, Ks) + — ^ • g{Ks) j 

< EE ( Hitcost(i, Ks) H • g{tis) j 

< (l + e)/3^_i(^(^l + ^^Optcost(r,K)+«;-5(K)^ +^^A,/3^_i^^-5(ks). (51) 

Here, (50) follows as Optcost(pi, k^) < Hitcost(i, k^) + 2wi/{(T — 1) by (44), and (51) follows from 
(48) and (46). Now, the total cost of movement of servers across the subtrees pi, ■ ■ ■ ,Pd is 

d 

"^Wi^ Xsg{Ks). (52) 
Adding up the costs from (51) and (52), the total cost incurred by the algorithm is at most 

(1 + e)^e-i ( (l + ^) Optcost(r, k)+w g{K)^ + E E + ' ^i^^^' ^^^^ 

By (49) and (47) we have 

d 

^WiY^ \sg{Ks) < 0(log(/c/e)) (Optcost(r, k) + w g{ti)) . (54) 

i=l sGAi 

Plugging (54) into (53) implies that the total cost is at most 

V^-i -7 + (log(A;/e))^ (^Optcost(r, k)+w g{K) 
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where 



7 = (1 + e) (l + - ) + 



( 



\og{k/e) 



a 



Thus, the claimed result follows. □ 

5 Weighted HSTs and Online Rounding 

In this section, we show how one can embed a cj-HST into a small depth weighted cr-HST with 
constant distortion, i.e., we prove Theorem 8. Also, we present an online rounding procedure for 
the fractional /s-server problem on an (un-weighted) cr-HST, that is, we establish Theorem 7. 

5.1 Embedding a-HSTs into Weighted a-HSTs: 

Theorem 8. Let T he a a-HST with n leaves, but possibly arbitrary depth. Then, T can be 
transformed into a weighted a-HST T such that: the leaves of T and T are identical, T has depth 
O(logre), and any leaf to leaf distance in T is distorted in T by a factor of at most 2a/ {a — 1). 

Proof. For a given rooted tree T', we say that it is balanced if: (1) there is no child p of the root 
such that the subtree T'{p) (rooted at p) contains more than half of the nodes of T', and (2) each 
subtree T'(p'), rooted at a child p' of the root, is balanced as well. It is easy to see that if a balanced 
T' has n leaves then its depth is O(logn). 

We now present a procedure that contracts some of the edges of T and yields a weighted cr-HST 
T' such that: (a) T' is balanced, and (b) for any leaf-to-leaf path in T, at least one (out of two) of 
the longest edges on this path has not been contracted in T'. 

The procedure works as follows. Let r be the root of T and pi, . . . ,pd he its children. We first 
recursively transform each of the trees T{pi) rooted at pi. Next, we check if there is a child pi such 
that T(pi) contains more than half of the nodes of T. (Note that there can be at most one such 
child.) If not, then T (with modified T(pi), . . . ,T{pd)) is the desired T' . Otherwise, T' is the tree 
T with the edge that connects pi to r contracted. 

It is easy to see that the tree T' obtained by the above procedure is indeed balanced and also 
the lengths of the edges on any root-to-leaf part decrease at rate of at least a. Thus, T' is a 
weighted cr-HST and (a) holds. Now, to prove (b), let us inductively assume that it holds for all 
the transformed subtrees T{pi). We prove it for the transformed T. Consider a leaf-to-leaf path in 
T. If the path is contained entirely in one of the subtrees T{pi), then we are done by our inductive 
assumption. Otherwise, the path contains two edges incident to the root r. As T is a cr-HST, these 
two edges must be the longest ones on this path. Thus, as the procedure could contract only one 
of them, (b) follows as well. 

Now, clearly, taking T to be T' satisfies the first two desired properties of T, as stated in the 
theorem. To prove that the last one holds too, we note that the length of any leaf-to-leaf path can 
only decrease in T (as we only contract edges). However, as we retain at least one of the longest 
edges, we have that the worst case distortion it incurs is at most: 




j=0 



j=0 



where a is the length of the longest edge on the path. The theorem thus follows. 



□ 
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5.2 Rounding the Fractional fc-server Solution Online 

We now show how to obtain an onUne randomized (integral) fc-server algorithm from a fractional 
k-server algorithm, when the underlying metric corresponds to a cr-HST T. The competitiveness of 
the obtained algorithm will only be an 0(1) factor worse than the competitiveness of the fractional 
algorithm, provided a > 5. The rounding procedure builds on ideas in [11] which were developed 
in the context of the finely competitive paging problem, and extends those ideas from a uniform 
metric to HSTs. 

Let 1, . . . , n denote the leaves of the cr-HST T. Recall that the fractional solution to the /c-server 
problem specifies at each time t the probability of having a server at leaf i. The variables 
satisfy x- = k.^ When these probabilities change at each time step t, the movement cost paid by 
the fractional algorithm is equal to the earthmover distance between the distribution x*~^ and x*. 
In contrast, an execution of a randomized algorithm can be viewed as an evolution of a distribution 
over /c-tuples of leaves. (There is no point in having more than one server at a leaf.) To make this 
more precise, let us define a configuration to be a subset C of {1, . . . , n} of size exactly k. The state 
5* at a given time t of a randomized fc-server algorithm is specified by a probability distribution 
figt on the configurations, where fis{C) denotes the probability mass of configuration C in state S. 

Now, we say that a state S is consistent with a fractional state x if, 

(consistency) for each i G [n] Z^c ieC /^•S'(^) ~ (^^) 

i.e., if the marginal probabilities of the state S coincide with x. 

We therefore see that in order to round a fractional algorithm to a randomized algorithm, we 
need to devise a way of maintaining (in an online manner) a sequence of states S'^jS^,... that 
are always consistent with the corresponding states x^, x^, ... of the fractional algorithm, and such 
that the cost of the maintenance is within an 0(1) factor of the movement cost of the fractional 
algorithm. More precisely, our goal is to establish the following result. 

Theorem 24. Let T be a a-HST with n leaves, a > 5, and letlP^x^, ... be a sequence of states of a 
fractional k-server algorithm. There is an online procedure that maintains a sequence of randomized 
k-server states S^, S^, . . . with the following properties: 

1. At any time t, the state 5"* is consistent with the fractional state x^. 

2. If the fractional state changes from x*"-*^ to x* at time t, incurring a movement cost of Ct, 
then the state S^~^ can be modified to a state 5* while incurring a cost of 0{ct). 

The key idea in bounding the maintenance cost of our rounding in Theorem 7 is to require that 
the states 5* that we produce are not only consistent with x*, but also each configuration in the 
support of S* does not deviate much from the fractional state x*. To this end, we introduce the 
following additional property for fc-server states. 

For a node p ofT and a fractional state x, let Xp = X]iGT{p) -^i fractional amount of servers 

that X has on the leaves of the subtree T{p) rooted at p. Also, let np{C) = C PI T{p) denote the 
number of servers in configuration C on leaves of T(p). We say that a configuration C is balanced 
with respect to x if np[C) € {[^pj, [^p]} for every node p. Now, we say that a /c-server state S 

*Note that, in principle, the definition of a fractional solution allows us to have x\ to be strictly smaller than 
k. However, as the starting configuration Hp has exactly k servers, it is easy to modify our fractional solution so that 
it always has exactly k servers, while not increasing its movement cost. 
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is balanced with respect to x if every configuration in its support (i.e., with non-zero probabiHty 
mass) is balanced with respect to x. That is, for all p and C for which HsiC) > 0, 



[xp\ < np{C) < \xp] . 



(56) 



Now, our approach to making the states we work with balanced is facilitated by the following 
definition. Let x be a fractional state and let S be some k-seiver state consistent with x (but S 
might be not balanced with respect to x). We define the balance gap, G{S,x) of S, (with respect 
to x) to be: 



Here, W{p) denotes the length of the edge from p to its parent. Clearly, when S is balanced with 
respect to x, its balance gap is zero. Intuitively, the balance gap measures the distance of the state 
S from being balanced. This intuition is made concrete by the following lemma. 

Lemma 25. Let x be a fractional state and let S be a k-server state on the leaves of a a-HST T, 
with £7 > 5, which is consistent with x (but not necessarily balanced with respect to it). Then, S 
can be converted to another state S' which is both consistent and balanced with respect to x, while 
incurring a cost of 0{G{S,x)). 

We will prove Lemma 25 later. First we show show how Theorem 24 follows from it. 

Proof of Theorem 24- Consider a fractional state x that changes to some other fractional state x' , 
and let S" be a /c-server state which is both consistent and balanced with respect to x. As (the 
state initially at time t = 0) is consistent and balanced with respect to x^, it is easy to see that 
to establish the theorem, it suffices to show that for any x, x' , and S as above, there is a fc-server 
state S' which is consistent and balanced with respect to x', and such that the cost of changing the 
state S to state S' is within 0(1) factor of the cost of changing the state x to x'. Furthermore, it 
is enough to restrict oneself to the case in which x' is obtained from x by applying an elementary 
move, i.e. Xi, for some leaf i, is increased by 6 and Xj/, for some other leaf i', is decreased by 5, 
where 5 can be chosen to be infinitesimally small. 

In light of this, we can focus on proving the existence of such S' . To this end, let p denote the 
least common ancestor of i and i' . Note that in this case the fractional cost of changing x to x' is 
at least 25w{p), where w{p) = W{p)/a is the length of the edges from p to its children. 

Consider now the following transformation of the state S. First, we add the leaf i to a probability 
mass of 5 on arbitrarily chosen configurations in S that do not contain i already. Next, we remove i' 
from some probability mass 5 of configurations containing i' . (Note that the existence of sufficient 
mass of each type of configurations follows from the consistency property of S with x.) Let S be 
the resulting state. 

Before proceeding, we note that as 6 can be taken to be arbitrarily small, we can restrict our 
discussion to the case in which i is added to a mass of of a particular configuration C, and i' is 
removed from a mass of of a particular configuration C". 

Now, to continue the proof, we observe that S is consistent with x' . However, the modified 
configurations C and C that S contains are not legal anymore as they do not consist of exactly k 
leaves. Also, S might be unbalanced with respect to x' . 



G{S,x) = W{p) MC) min (|np(C) - [x^J |, |np(C) - [x^l |) . 



(57) 
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We show how we can modify S to fix these shortcomings. To this end, we note that as C contains 
i, and it was balanced with respect to x prior to adding i, it must hold now that np{C) > [xp\ + 1. 
Similarly, for C, it must hold that np{C') < [xp\ < [xpj + 1 = np{C). Thus, by the pigeon hole 
principle, there must exist a leaf j of T{p) which is contained in C, but not in C. Let us modify 
S by removing j from C and adding it to C. Clearly, this makes all the configurations in S legal 
again, keeps S consistent with x', and the total movement cost corresponding to this modification 
(due to deleting i, adding i' , and swapping j) is at most 4:6w{p)a/{a — 1) = 0{6w{p)), for cj > 5, 
which is within an 0(1) factor of the cost of changing xtox'. 

Unfortunately, S might still be unbalanced with respect to x' . To bound the imbalance, let us 
first consider the case in which [xgj = [x^J and \xq] = [x^] for all nodes q. This implies that 
all the configurations in S other than the modified configurations C and C are already balanced 
with respect to x' as they were balanced with respect to x. Now, we note that Xq ^ x^ only for 
nodes q that are on the path between i and i' (but excluding p). Similarly, nq{C) and nq{C') could 
change only for nodes on the path from p to i, i', or j (but, again, excluding p). Therefore, as both 
C and C were initially balanced with respect to x, we can conclude that the total imbalance gap 
G{S,x) = G{S,x') of S after our modifications is at most: 

3 • 26w{p) + i + + . . = 0{6w{p)). 

Thus, by applying Lemma 25 to S, we obtain a state 5' that is consistent and balanced with respect 
to x' and the cost of this procedure is 0{6w{p)), as desired. 

Now, it remains to deal with the case in which either [xgj 7^ L^^J, or [xg] 7^ l^'ql, for some 
nodes q. To this end, we note that by taking 5 to be small enough (but non-zero), we can ensure 
that for each q for which at least one of these two inequalities holds, it must be the case that 
either Xq or x^ is an integer. In the former case, we have that for all the configurations C" in S 
that have non-zero mass, nq{C") = Xq = [xgj = [xg] and thus [x^J < nq{C") < [x^]. In the 
latter ' I < and for every relevant configuration C" in S, [xgj < nq{C") < [xg], 

the total probability mass of configurations C" in S, such that nq{C") = \xq\ < [x^J = x'q or 
nq{C") = \xq~\ > \x'q \ = x'q, Can be at most 6. 

As a result, we see that the total probability mass of configurations in S that are not balanced 
with respect to x' is at most 35 (the contribution of 25 comes from the modified configurations C 
and C). Thus, by calculating the imbalance gap similarly to what we did before, we can show that 
G{S,x') is 0{5w{p)), and once again use Lemma 25 to obtain the desired S'. This concludes the 
proof of the theorem. □ 

It remains to prove Lemma 25. 

Proof of Lemma 25. Let us call a node p in our it-HST T imbalanced if 

Ms(C') min(|np(C) - [ (C7)-[xpl|)>0. 

c&s 

If no node is imbalanced, then clearly G{S, x) = 0, and we are already done, so we assume that 
this is not the case. Let p be an imbalanced nodes which is at the highest level of T (breaking ties 
arbitrarily). We note that p cannot be the root r of T, as each configuration has exactly k servers, 
and Xr = k. 



38 



Consider now a configuration C for which fis{C) > and np{C) ^ {[^pj, \xp\} and let us 
assume that np{C) < \xp\ (the other case can be treated similarly). As S is consistent with x, 
YliC" ^^s{C")np{C") = Xp, and so there must be some other configuration C with fis{C') > such 
that np{C') > [xp\ + 1. So, in particular, we have np{C') — np{C) > 2. 

Now, let p denote the parent of p in T (recall that p is not the root). As we choose p to be an 
imbalanced node at the highest possible level, p must be balanced, and hence \np(C') — np{C)\ < 1. 
But, since np{C') — np{C) > 2, it implies the existence of some other child p', p' 7^ p, oip such that 
npi{C') < npi{C). 

Therefore, by the pigeon hole principle, there must exist a leaf i in the subtree T{p) rooted at 
p which is contained in C", but not in C . Similarly, C must contain a leaf i' in T{p') which is not 
contained in C' . Let 5 = mm{fis{C), fis{C')) (note that 6 > 0). Consider a modification of S in 
which we take any arbitrary probability mass 5 of configurations C and replace i' by i in them. 
Next, we take any arbitrary probability mass 5 of configurations C and replace i by i' in them. 

Let us summarize the properties satisfied by S after this modification. First, the state remains 
consistent with the fractional solution x, because the marginals of the leaves i and i' have not 
changed. Second, since neither np{C) nor np{C') have changed, p remains balanced. Moreover, the 
only nodes for which the imbalance could have changed are on the path from i to p and i' to p'. 
Third, replacing i' with i (in a (5 measure of C) increases np{C) by 1 for these configurations, and 
replacing i with i' , leaves the quantity np{C') to be of value at least [xpj. Together, this implies 
that the imbalance of p decreases by at least 6. Finally, as npi[C) > np'{C') before the modification, 
the imbalance '^c"es ^^s{C") min(|np'(C") — \xpi\ |, \npi{C") — [xp/] |) of p' can only decrease. 

Now, if we analyze the change in the imbalance gap of S, in the worst case, the imbalance of 
every node from p to i (excluding p) could have increased by 25 due to the addition of i in C or 
removal of i from C . Similarly, the imbalance of every node from p' to i' (excluding p') could have 
increased by up to 26. Together with the above observations, this implies that the imbalance gap 
of S decreases by at least 

W{p)5 - A5w{p) (^1 + 1 + + . . = W{p)5 = ^{W{p)5), 

where the last inequality uses the fact that a > 5. 

On the other hand, as both i and i' lie in T[p), the movement cost incurred in the above 
procedure is at most A6w{p){a / {a — 1)) = A5W{p) (^JLi) ? which is within 0(1) factor of the reduction 
in the imbalance gap. The lemma follows by applying the above steps repeatedly until the imbalance 
gap reaches zero. □ 
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A Proof of Lemma 11 

Consider first the simpler case of the fix stage. Here the variables evolve according to 



As the derivative of yjj only depends on yjj and is continuous, the function yjj is well defined. As 
the derivative is when yjj = 1, and non- negative otherwise, it ensures that yjj always stays in 
the range [0, 1]. Finally, the monotonicity property holds here, as it holds initially when r = 0, and 
whenever yij < yij/ for some j < j', we have that dyj^/dr < dyJ-,/dT, unless y^ ■, = 1, in which 
case monotonicity holds trivially. 
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We now consider the hit stage. Recall that configuration y^!-, for each («, j), evolves according 
to Equation (13), which we reproduce here for convenience: 



^ ^ f if (i,j) 

~ 1 ^ [yh + ■ (^(^) - "^Ki) otherwise. 



(58) 

In the above, the set denotes the active coordinates at time rj (cf. Definition 9) and the 
normalization factor N{rj) can be expressed as (cf. (14)) 

fO if E,.i -«(*). 

iV(,) = E^,...i(.W. ^^^^ ^ _ (59) 

First, we show that during the hit stage blocks never split. To this end, we note that when two 
blocks merge, their y-values are identical, and since we also modify A'' to be identical for these 
blocks, all the variables contained in the merged block evolve in the same way from that point on, 
as desired. (Note that we do not assume here that the trajectory Jp is well defined and unique, we 
just argue that any trajectory compatible with our definition of derivatives cannot split blocks.) 

Now, we proceed to analyzing the properties of the evolution described by Equations (58) and 
(59). As a first step, let us prove the following claim that will be helpful later. 

Claim 26. Consider a feasible configuration If' , for ?/ G [0, 1], a subset of coordinates A, and a 
hit cost vector A. Define for any r] > r]' , 



Erj 
{i,j)&A \j 



where j = [Vij + j ■ ^ow, if we make the configuration y^ evolve according to 

dy^ ^ f tf{i,j)^A, 
dr] I -^^ (^y'j'j + 13^ ■ {NAiv) - aXij) otherwise ' 

then we have that A^a(??) does not increase, i.e., NA_{rj) < NA{rj') for any rj > rj' . 

Proof. Let us fix a 77 > rj' and denote rj'^ = rj + drj. We will prove that NA{rj'^) < NA{r]), which, in 
turn, implies our claim. To this end, note that one can view NA_{r]) as a weighted average, over all 
coordinates in A, of the value of aXij, where ^ is the weight which we attribute to coordinate 
{i,j) at time r]. 

Now, the key observation is that the way the yijs evolve implies that y^^ > y^ ■ if aAjj < NA{ri), 

and y^ ■ < y^ ■ if aAjj > NA{r]). So, as the weights u'-j are directly proportional to y]'j, we can 
conclude that during our evolution, the weights of coordinates that have a value of aXij above the 

average value NA{r]) (i.e. j < ■ in this case) can only decrease, and the weights of coordinates 

that have a value of aAjj which is at most the average (i.e., we have u^^ > ul ■ for such {i,j)) can 
only increase. As a result, we can express N^ir]^) as 
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where Ajj = vP^ ■ — vP^ ■ and, by our discussion above, we have that Ajj > if aXij < NaIi]), and 
Ajj- < otherwise. 

Now, if the right hand side of (60) is at most NAi'r])^ then we are done. Otherwise, we must 
have that 

2^{ij)eA'^i,j 

as it is easy to check that if "g^^* > c, then also | > c, as long as t and d are such that t > if 
d < c, and t < otherwise. 

But, the left hand side of (61) is by definition equal to N^ir])- So, the obtained contradiction 
implies that indeed NA{r]^) < iV^(ry), and the claim follows. □ 

Now, observe that the set A"^, as well as N{7]), depend only on the state y^' and the hit cost 
vector A^. As a result, both and N{r]) can, in principle, vary drastically between points of time. 
However, as we show in the following claim, "locally" they tend to behave in a regular manner. 

Claim 27. For any rj' £ [0,1) and feasible configuration y^' , there exists an tj" , r( < ry" < 1, such 
that: 

(a) = A^ for each rj G [i]' , t]"), i.e., the set A'^ of active coordinates does not change for -q < rj" ; 
(h) \^ = for any i] € [r]',r]"), i.e., there are no block merges until time rj" ; 

(c) the sum Xli j vlj bigger than kd — K{t) for all rj < rj" , unless it was already equal to kd — n{t) 
at time tj' ; 

(d) the configuration y"^" is well defined and feasible for rj € [r]',ri"] (note that this interval contains 
r,"). 

Furthermore, one can assume rj" is maximal, that is, unless ij" is already equal to 1, there exists 
no larger value of rj" for which the above holds. 

For future reference, we call rj" , defined as in the above claim, the horizon of y^' . 

Proof of Claim 27. Let us define a new process for the evolution of y^j in a suitably small neigh- 
borhood of rj' as follows: 

I otherwise 



where 



Nir,) 



if E:,j yt>kd-^it), 



While the new process looks similar to the original one, there are some crucial differences. First, we 
do not care if a variable y^ ■ either exceeds 1 or becomes negative, i.e., the active set is not updated 

as rj progresses and remains the set A^ . Second, the cost vector is not updated and it remains A'' , 
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i.e., blocks are not merged and the monotonicity requirement (4) are ignored. Third, the value of 
N(t]) is not changed once the value of J^ijUij ^i^s the quota. If YlijVlj > — K(t) at time rj' , 
N(r]) will be always 0. As a result, the latter sum might become less than kd — K{t) at some point. 

Now, the key observation that makes this new process useful to us is that it is identical to our 
original process, as long as the evolving configuration is still feasible, and no coordinate becomes 
active and changes in the original process. More precisely, the trajectories of the two processes 
coincide as long as in the configuration Ip evolving with respect to this new process: 

(1) y^j is in [0, 1] for each € A^' . (Otherwise, a coordinate violating this condition would 
have already become inactive in the original process.), or 

(2) N{r])—aX^j is non-negative (respectively non-positive) for ^ A^' withy^^- = 1 (respectively 

yf- = 0). (Otherwise, a coordinate violating this condition would have already become 
active and would have changed in the original process.), or 

(3) the monotonicity condition holds, i.e., y^ ■ < y^jj^i for each (i,j). (Otherwise, a block merge 
would have already happened in the original process.), or 

(4) the server quota is obeyed, i.e., YliijVlj ^ kd — K{t). (Otherwise, N{r]) would have already 
changed so as to guarantee that the number of servers stays at the quota.) 

Let T]^ for £ S {1, 2, 3, 4} be the last rj > t]' for which the £-t\i condition is satisfied in the interval 
[t]', t]]. Also, let rf (respectively rj^) be the first time rj > rj' in which y^j = (respectively, y^j = 1) 

for £ A^' (respectively ^ A"^'), and iV(r/) — j = 0. As in the new process N{r]) - and 
thus all the derivatives and the trajectory of If - are continuous and bounded in the our interval 
of interest^, such maximal r]^s and ff, r]^ always exists. Let us take rj" = minjmin^ 7/^,77'', 77^, 1}. 
Note that this implies that y'^ is well defined and feasible for any r] € [i]'iV"] " so, condition (d) 
is satisfied. Furthermore, it is not hard to verify that all of the conditions (a)-(c) hold for such 
r/", and that this r]" < 1 is indeed maximal with respect to satisfying these conditions. So, if we 
manage to prove that also rj" > rj' , our claim will follow. 

To this end, observe that by applying Claim 26 with A = A^ and A = A'' , we see that N{rj) 
can only decrease for r] > rj' . Therefore, all the derivatives (62) in this process can only decrease 
too. This implies, in particular, that if some variable stops increasing (i.e., its derivative becomes 
non-positive) at some point, it will never increase again. 

Now, note that if G A"^ then 

• if • G (0, 1), then, as the derivatives are bounded, it must be the case that y"^, G (0, 1) until 
some time rj > rj ; 

• if y'i'j = 0, then its derivative N{rj') — ctX^j at time rj' has to be strictly positive (otherwise, 
(i,j) would be inactive at time rj') and thus y^j (respectively its derivative) has to stay 
non-negative (respectively positive) until some time rj > rj' too; 

^More precisely, N{ri) is continuous and bounded, as long as each ■ is bounded away from — /3. However, we are 
interested in only analyzing the process as long as all the variables stay non-negative, so for the sake of our analysis, 
Niij) is indeed continuous and bounded. 
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• finally, if yj ■ = 1, then its derivative N{r]') — ctX^j at time ?/ has to be non-positive (as oth- 
erwise, ^ A^' ). However, by our discussion above, it means that neither this derivative, 
nor J, will ever increase again. 

Thus, we can infer from the above that both r]^ and rf are strictly larger than 7]'. 
Next, let us focus on some ^ A"^ . We have that 

• if y^j = 0, then N{r]') — oiX"^ ■ has to be non-positive (otherwise, (i, j) would be active at time 
T]') and as N{j]) never increases, N(ri') — oX^j will never become positive; 

• if y^j = 1, then N{r]') — aX^j at time r]' has to be strictly positive (as otherwise, (i, j) € A'^'). 
So, this quantity has to remain positive for some time rj > rj' . 

Thus, we see that both rj^ and T]^ are also strictly larger than rj'. 

Observe now that if y^ j = y^ for some + 1) G A^' , we must have that X^ j > X^ 

Otherwise, the blocks to which (i,j) and + 1) belong would have been merged at time rj' . 
This means that the derivative of y^j is always bounded from above by the derivative of y^jj^i- 
Thus, such a pair of coordinates will never violate the monotonicity property. On the other hand, 
if y^ ■ < y^ j^i, then, as the derivatives are bounded, there always exists f] > rj' for which this strict 
inequality still holds (and thus monotonicity is not violated). Hence, we get that 77^ > r]' . 

Finally, to see that > r/' as well, we note that if Ylij Vij = kd — K{t), then, by design, it will 
remain equal henceforth. If, however, the latter sum is larger than kd — n{t) at time rj' , then it 
would still remain so for some rj > r]' . 

Thus, indeed we have rj" = minjmin^ 7/^, rj'^, rj^, 1} > t]' , concluding the proof of the claim. □ 

In light of the above claim, one can consider obtaining a feasible configuration y^ from the 
starting (feasible) configuration if by simply gluing together the trajectories corresponding to the 
horizons. More precisely, one could start with rjQ = 0, and for each i]s, with s > and ijg < 1, define 
r]s-\-i to be the horizon of y^" . Note that, as we start with the feasible configuration if, Claim 27 
implies that all If" are well defined and feasible too. 

Now, the only reason why the above approach might not end up giving us the desired feasible 
configuration is that, a priori, it is not clear whether the sequence {??s}s ever reaches 1. That 
is, even though we know that 770 = and r/^+i > r]s, it might still be possible that this sequence 
converges without ever reaching 1, and thus there is no s with 77^ = 1. 

In order to rule out this possibility, we will prove that the total number of horizons is always 
finite. Observe that each horizon can be associated with at least one of the following events: (a) 
the set of active coordinates changes, or (b) a block merge occurs, or (c) the number of servers hits 
the quota. Thus, it suffices to show that the total number of such events is bounded. 

To this end, let us note that in our evolution, once Vij becomes equal to kd — K,{t), N{r]) 
is chosen so that Ylij '^vl j/df] = 0. So, once we hit the quota we stay there throughout the rest of 
the hit stage. Hence, there can be at most one event of type (c). Also, as we have already argued, 
during our evolution blocks never split once they are formed, and thus the total number of block 
merges (i.e., events of type (b)) can be at most k. 

It remains to bound the number of events of type (a), i.e., the ones corresponding to vari- 
ables becoming active/inactive. For notational convenience, let us say that a coordinate (i,j) 
0-inactivates (respectively, 1-inactivates) at time rjg, for some s > 1, if N{r]s) < aA^^ (respectively. 
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Nirjs) > aX^^j) and y'^^j = (respectively, y'^^j = 1), but {i ,j) was active at time r]s-i. We prove 
the following claim. 

Claim 28. N{r]) can increase only at a horizon, i.e., for any s > with rjg < 1, N{rjs) > N{ri) 
for rj € [rjs,rjs+\). Furthermore, if N{r}) indeed increases at time rjg^i, then at rjg^i we have an 
occurrence of either a block merge, or the quota is hit, or a 1-inactivation of some {i,j) with y^"- < 1. 

Proof. First, consider the case where at time rjg the number of servers is still below the quota. By 
(59), it means that N{t]) = = N{r]s), for rj € (?7s,?7s+i), and N{r]s+i) = unless the quota is hit 
at time ?7<j_|_i. So, the claim follows in this case, and in the rest of the proof we can assume that the 
number of servers is already at the quota at time r]s. 

First, we prove that N{r]s) > N{r]) for r] G [r]s,r]s+i), i.e., the first part of the claim. Let us 
fix some r] € [r]s,'i]s+i). Note that by the definition of the horizon, we have that = A^" and 
\v = \Vs _ go^ for our purposes, it suffices to show that whenever A^ = A^ = A, and X^ = X^ = A, 
for T]"*" = r/ + dr], we have that N[ri) > N{r]~^). This follows immediately from Claim 26. 

Now, to prove the second part of the claim, let us assume that none of the events mentioned 
in the statement of the claim occurred at time r/^+i, otherwise we are already done. So, we have, 
in particular, that A^^- = A^^^^ = Xij for each (i,j). This implies that if there is an {i,j) with 
^iT^ = y^j- = 1 that becomes active at time r/<j+i, then by Definition 9 it must be the case that 

iV(r?,)>aA:';^. = aA:';/^ >iV(7?,+i). 

So, in this case N(rjs) > N{rjs+i), and thus we can restrict ourselves to the scenario in which the 
only coordinates (i,j) that become active at time i]s+i have =0. As a result, we have 

where A^ is the set of coordinates (i,j) such that (i,j) 0-inactivates at time r]s+i, A^ contains 
{i,j)-s which 1-inactivate at that time and y^^- = 1, and A^ is the set of (i,j)-s that become active 
at time rjs+i with y^^^^ = 0. 

Now, observe that if some {i,j) G ^L, then we need to have OiXij > N{r]s). Otherwise, (i, j) 
would have already been inactive at time rjg. Furthermore, we actually need to have aXij = N(rjs), 
as otherwise the derivative of y^j would be negative in the interval [r]s,r]s+i), contradicting the fact 
that y^j-^^ = 1- (Recall that we have already proved that N{r]) - and thus all the derivatives ~ do 
not increase in the interval [r/^, r/^+i)-) 

So, by the above, and Definition 9, we can conclude that 

aXij > N{r]s+i) for each (i, j) € ^° , (64) 

aXij = N{r]s) for each (i,j) e A^_, (65) 

aXij < N{r]s+i) for each (i,j) e A^. (66) 

On the other hand, we can express N(r]s+i) as the weighted average of aAjjs over the set A^^+i 
(cf. Claim 26), i.e. we have 

l\[ris+i) - — -iji^n — , 
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where = ^ (yl^ + . 

Now, as = (A'''' \ (A° U ^L)) U we can utilize conditions (64) and (66) to bound 

N{r}p^i) from above by a corresponding weighted average of aXij-s over the set A^i'' \ ^L. In 
particular, we have 

_ T.(i,j)eA^s+i uf+^ ■ aXij _ 12{ij)s{(Avs\{A0_uAl))uA+) '^IT ' "^^J 
/^{i,j)&A^^+^ 2^{i,j)£{{Avs\(A0_uAl))uA+)^i,j 



^{i,j)e{Avs\A\) "tj 
l^{i,j)G(Ai.\Al)Ui,j 



< > (67) 



where the last inequality follows as for any a, 6 > 0, | < ^^^pf^rr^^j whenever ci > |, C2 < f , and 
ti>0,b>t2>0. 

If the last expression in (67) is at most N{7]s), then we are already done. So, let us assume, 
for the sake of contradiction, that it is strictly larger than N{r]s). In this case, by (66), we need to 
have that also 



as for any a, 6, c, t > 0, if | > c, then also > c. However, by applying Claim 26 with A = A^", 
we have that 

l^{i,j)GA^s «jj 2-y{i,j)GA^s "jj 

contradicting (68), and thus proving that indeed N{rjs) > N{i]s+i). The proof of the claim is now 
concluded. □ 

Now, we are ready to bound the number of events of type (a). To show that the number of 
these events is finite, it suffices to show that the number of 0-inactivations and 1-inactivations is 
finite. Also, observe that during the period in which the number of servers is below the quota, by 
definition, we have N{rj) = 0, and thus variables can only decrease. As a result, coordinates can 
only 0-inactivate in that period, and once they become inactive they stay that way. Hence, we have 
at most kd such events. 

In light of the above, we can focus on analyzing the events after reaching the quota. Note that in 
this case we can assume that N^i]) > 0. (If N{rj) = 0, then all derivatives are equal to zero, and the 
desired bounds trivially follow.) As we have that X^j is always zero when i ^ i, N{i]) > implies 
that coordinates with i ^ i can only increase, and once they 1-inactivate they stay inactive. 
As a consequence, it suffices to show that the number of 0-inactivations and 1-inactivations is finite 
for all coordinates (i,j) with i = i. In order to do so, we prove the following claim. 

Claim 29. The total number of 1-inactivations of coordinates {i,j) is finite. 

Proof. We will prove the claim first for j = k and then consider consecutive j-s in decreasing order. 
As a result, our task is to prove for a given j, that (i, j) 1-inactivates a finite number of times, 
provided that the number of 1-inactivations is finite for all coordinates with j < j' < k. 

To this end, we argue that whenever there are two consecutive 1-inactivations of some coordinate 
{i,j) - the first one at time rjgi, and the second one at time r/^// - then in the interval [r]s',r]s"] we 
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have either a block merge, or the quota is hit, or a 1-activation of a coordinate that has 

either i' ^ i or j' > j. As we know that the number of occurrences of each of these events is finite, 
we get the desired proof. 

To establish the above, let rfj for s' < s < s" be the time in which is activated between the 
two 1-inactivations. Observe that as N(rjs) < ct^j j and N{r]sf') > ctXjj, there is a time r]s* with 
s < s* < s" in which N{r]) increases above aAj^-. (Recall that by Claim 28 we know that N{r]) can 
increase only at horizons.) Without loss of generality we take s* to be the first s > s corresponding 
to such an increase. 

Now, in light of Claim 28, we know that N(r]) increases at time rjg*. Thus, to conclude our proof 
it suffices to show that if we have a coordinate that 1-inactivates at time tjs* and y/*ji^^ < 1) 

then we cannot have i' = i and j' < j. 

We consider two cases here. The first one corresponds to s* < s" . In this case we have 
yVa* ^ -|^^ Q^iierwisc would be 1-inactivated already at time s*, instead of s". However, by 

the monotonicity property (4), we have that y^^*, < y^"* for all j" < j. So, if i' = i, then 
cannot 1-inactivate at time rfs* if < j, and the claim follows. 

Consider now the remaining case of s* = s" . If we have i' = i and j' < j, then we must have 
Ajj/ > Xjj. Otherwise, condition (8) for block merge would trigger at time rjs". As a result, by 
(58), we know that in the interval of our interest the derivatives of ., are bounded from above by 
the derivatives oi ifl .. 

Furthermore, we have that the derivative of ifi . is always non-positive for rj G [r/g-, ??s")- This is 
SO, as by the definition of s*, N[ri) < c^Xj j for i] G [ijsiVs*) = [ils^Vs")- As a consequence, we must 
have both . and y^ ., to be equal to 1 for all r] G [rjs, %")> &s otherwise these variables would not 
be able to reach 1 at time i]s". This, however, contradicts the fact that y^/^j-r^ has to be strictly 
smaller than 1, as s* — 1 = s" — 1 > s. Thus, we cannot have i' = i and j' < j and our claim 
follows. □ 

Finally, it remains to bound the number of 0-activations of coordinates We do this by 

simply noting that if there are two consecutive 0-activations of some coordinate (i, j), then N{r]) 
has to increase at least once between these two events. But, by Claim 27, it means that one of the 
events, (whose total number is already bounded), would also occur in this period. Therefore, the 
number of 0-activations is also finite and we can conclude the proof of Lemma 11. 

B Proof of Lemma 22 

Here we prove Lemma 22. As mentioned earlier, the proof is implicit in the work of [16], and we 
make it explicit here for completeness. We begin with some notation, and state another result that 
we need. 

Let M be an arbitrary metric space. Let C[0] denote the configuration specifying the initial 
location of the /c-servers. We assume that the servers are labeled, so for every k' < k, the first 
k' entries of C[0] specify the location of the first k' servers. Let p be some fixed k-seiver request 
sequence. Let Opt(A;',X) denote the optimum cost of serving p with k' servers on M, starting in 
C[0] and ending in configuration X (for notational ease, we are suppressing the dependence on 
p,M,C[0] here). Let Opt(/c') = minx Opt{k' , X), denote the minimum cost of server p starting in 
C[0]. 
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Lemma 30 ([16], Corollary 2). Let p be some fixed request sequence and C[0] be some fixed initial 
configuration. For any ki,k2 G [k], given any state X on ki locations, there exists another state Y 
such that 

1. |X n y| = min(|X|, i.e. Y overlaps with X as much as possible, and 

2. Opt(/c2,y) < Opt(A;2) + Opt(A;i,X) — Opt(A:i). That is, the excess cost incurred for an 
optimum k2-server solution to end in Y, is no more than the excess cost incurred for the 
optimum ki-server to end in X . 

This lemma and its proof can be found in [16] (Corollary 2). 

Let T be a weighted cr-HST. Again, for notational convenience, let us drop p, C[0], and the 
underlying metric T from the notation (these remain the same, and dropping them will not cause 
any confusion). Given a quota pattern k, recall the definition of Optcost(K, t) as the optimum 
cost of serving p until time t with quota pattern k. We also use Optcost(K) = Optcost(K, oo) to 
denote the optimum cost of serving the entire sequence p. As previously, let us define /i*(k) = 
Optcost(K(t) ■ l,t) — Optcost(K(t) -1,^ — 1) and q^k) = J2t>i l^i'^) ~ ^i^ ~ 1)1- ^et D denote the 
diameter of T. 

We will prove the following, which is the same as Lemma 22. (In that notation, note that 
A < W{p)/{a - 1) for T{p).) 

Theorem 31. 

^ /i*(k) - A • g{K) < Optcost(K) < ^ /i*(k) + A • g{K). 
t t 

Proof. We do an induction on the value of g{n). In the base case, when g{K) = 0, the vector k is 
constant throughout, say k = k ■ 1. In this case, the claimed result holds trivially as the sum over 
/i* telescopes and we obtain 

^/i*(k) = ^(Optcost(A; •!,*)- Optcost(fc • l,t - 1)) = Optcost(K). 

t t 

So, let K be such that ^(k) > 0. Let r be the earliest time when k{t) ^ k{t + 1). Define a new 
quota pattern k' as 

k{t + 1) if t < r 
K{t) if t > r. 

Note that both n and k' are constant for t < t. Also, gi^') = g{n) — \k{t + 1) — k(t)| < g{K), and 
hence we can inductively assume that the claimed result holds for k' . 

We first show that Optcost(/t) > ^ih^i^-) — D ■ g{K). Fix some solution that attains cost 
Optcost(K) and let C[t] denote its configuration at time t. Applying Lemma 30 with X = C{t) 
and ki = k{t) and /c2 = k'{t), there is some configuration Y satisfying 

Optcost(/c2 • 1, r, Y) < Optcost(A;2 • 1, r) + Optcost(/ci • 1, r, X) - Optcost(A;i • 1, r) (69) 
\XnY\ = mm{ki,k2). (70) 

We construct a solution S' corresponding to k' as follows: Until time r, S' follows the solution 
Optcost(A:2 ■ l,''",^)- Then, after serving the request at t = r, it switches to state C[t] = X, and 
henceforth for t > t sets its the configurations C'[t] = C[t]. Now 

cost (5') = Optcost(A;2 • 1, r, Y) + c{Y, X) + Q, 



K'{t) 



49 



where c(Y, X) is the cost of moving from state Y to X, and Q is the contribution of the solution 
Optcost(K) starting from time r and state X (recall that C[t] = X). 

It is easily checked that the solution S' constructed above is feasible for quota pattern k' . As 
the optimum solution for k' can only be better, cost(S") > Optcost(K') and since Optcost(«;') > 

h^{K') — D ■ g{K.') by the inductive hypothesis, it follows that 

h\K') - D ■ g{K') < cost (5') = Optcost(A;2 • 1, r, Y) + c{Y, X) + Q. (71) 

t 

By (70), \Y nX\ = mm{ki,k2) and hence c{Y,X) < D\k2 - ki\ = D\k{t + 1) - K(r)|. Thus 
c{Y, X) + D ■ gin') < D ■ g{K), and hence (71) implies that 

Q>Y1 ^*^^') ~ ^ ■ f ('^^ ~ Optcost(A:2 • 1, T, Y). (72) 
t 

On the other hand, as X = C[t] we have 

Optcost(K;) = Optcost(K;, r, X) + Q = Optcost(/ci • 1, r, X) + Q. 

Thus by (72), 

Optcost(K) > Optcost(A;i • 1, r, X) - Optcost(/c2 • 1, r, F) + ^ h^n') - D ■ g{n) 

t 

> Optcost(A;i • 1, r) - Optcost(A:2 • 1, r) + ^ h^n') - D ■ g{K) (73) 

t 

= Y.h\^)-D-g{K), (74) 

t 

implying the desired lower bound. Here (73) follows from (69), and (74) follows from (73) since 

T 

^/i*(k) = Optcost(A;i • 1, r), 
t=i 

r 

^^/i*(k') = Optcost(A;2 • 1, r), 
t=i 

and /i*(k) — Ylt ^*('^') = SI=i ^*(^) — X]t=i since k, and k' are the same for any t > t. 

We now show show the upper bound on Optcost(K). The proof is similar to the one above. Let 
k' be defined as previously. Let {C"[t]}t denote the configurations for some fixed solution that has 
value Optcost(Av'). Applying Lemma 30 with X = C'[T],ki = k'{t) and /c2 = k(t), we obtain a 
configuration Y satisfying 

Optcost(A:2 • 1, T, Y) < Optcost(A;2 • 1, r) + Optcost(A;i • 1, r, X) - Optcost(A;i • 1, r) (75) 
\XnY\ = mm{ki,k2). (76) 

Consider the following solution S corresponding to k: Until time r, S mimics the solution Optcost(A;2' 
1, T, y). Then, after serving the request at t = r, it switches to state C'[t] = X, and henceforth 
for t > T sets its the configurations C[t] = C'[t]. Now 

cost (5) = Optcost(A;2 • 1, r, Y) + c{Y, X) + Q, 
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where Q is the cost of solution Optcost(K') incurred from time r starting at state Y . Again S is 
feasible for quota k, and hence Optcost(K) < cost(S'). By definition of X, 

Optcost(K') = Optcost(A;i • 1, r, X) + Q. 

Thus, 

Optcost(K) < Optcost(fc2 • 1, r, Y) + c(y, X) + Optcost(K') - Optcost(A;i • 1, r, X). 

As Optcost(K') < + by the inductive hypothesis and c{Y,X) + D-g{K') < D-g{K), 

we obtain 

Optcost(K) < Optcost(A:2 • 1, r, + ^ + D ■ c/(/t) - Optcost(A;i • 1, r, X) (77) 

t 

< Optcost(/E2 • 1, r) - Optcost(/ci • 1, r) + ^ - L» • g{K) (78) 

= j;/i*(A^)-Z)-5(A^), (79) 

t 

implying the desired inequality. Here (78) follows from (75), and (79) follows by noting that 

T T 

h\K') - h\n) = Y - Y h\K) = Optcost(fei • 1, r) - Optcost(A;2 • 1, r). 

t t t=i t=i 

□ 
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